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Welcome  to  Freiburg  im  Breisgau,  the  site  of  the  29th  International 
Symposium  on  Multiple-Valued  Logic.  Freiburg  is  a  period  middle-ages  city 
founded  by  the  Dukes  of  Zahringen  (the  right  to  hold  markets  was  granted  in 
1120  A.D.).  Its  historical  development  was  especially  influenced  by  its  more  than 
400-year-long  association  with  the  Habsburg  Dynasty.  The  city  developed  from  a 
walled  middle-ages  market  town  to  a  fort  city  (17th/18th  century)  to  the  present- 
day  modern  city  of  about  200,000  inhabitants.  It  is  the  first  time  that  the 
symposium  is  being  held  in  Germany. 

Like  in  the  past  years  researchers  from  different  areas  discuss  the  latest  results 
in  the  field  of  multiple-valued  logic.  The  symposium  is  co-sponsored  by  the 
Albert-Ludwigs-University,  Freiburg,  and  the  IEEE  Computer  Society. 

We  wish  to  express  our  gratitude  to  the  Symposium  Committee  for  their  hard 
work  in  preparing  this  event.  Dr.  Elena  Dubrova  served  as  a  Program  Chair  for 
Europe  and  Africa,  Dr.  Takahiro  Hanyu  served  as  a  Program  Chair  for  Asia  and 
Australia,  and  Prof.  Michael  Miller  served  as  a  Program  Chair  for  the  Americas.  We 
would  like  to  thank  them  for  organizing  an  excellent  program.  ISMVL’99  has  been 
the  joint  effort  of  many  people.  We  especially  like  to  thank  Nicole  Drechsler 
(Financial  Chair),  Frank  Schmiedle  (Publication  Chair),  and  Dr.  Christoph  Scholl 
(Local  Arrangement  Chair). 

We  would  also  like  to  thank  Thomas  Baldwin  of  the  IEEE  Computer  Society  for  his 
work  in  publishing  this  volume. 

Finally,  we  would  like  to  wish  all  participants  a  beautiful  time  in  Freiburg  and 
the  Black  Forest  area  and  hope  that  we  will  have  many  stimulating  discussions. 


Rolf  Drechsler 

Symposium  Chair 


Bernd  Becker 

Symposium  Co-Chair 


We  are  pleased  to  extend  our  welcome  to  the  1999  International  Symposium  on 
Multiple-Valued  Logic.  It  has  been  our  pleasure  to  have  coordinated  the 
selection  of  a  technical  program  which  we  are  confident  you  will  find 
interesting  and  stimulating.  This  year  there  are  42  contributed  papers  presented  by 
MVL  researchers  from  around  the  world.  There  are  sessions  on  algebra, 
applications,  clones,  circuits,  decision  diagrams,  decomposition,  fuzzy  logic,  logic, 
logic  design,  and  testing.  This  year’s  program  is  thus,  as  in  the  past,  representative 
of  the  diverse  interest  in  multiple-valued  logic  research  across  many  disciplines. 

This  year’s  program  includes  two  invited  addresses.  Dr.  Toshio  Baba,  NEC  Japan 
will  address  Development  of  Quantum  Functional  Devices  for  Multiple-Valued 
Logic.  Prof.  Dr.  David  Basin,  University  of  Freiburg,  will  speak  on  Structural  and 
Behavioral  Modeling  with  Monadic  Logics.  We  thank  each  of  the  invited  speakers 
for  agreeing  to  address  the  symposium  and  for  their  written  contributions  which 
appear  in  these  proceedings. 

It  is  fitting  as  we  rapidly  approach  the  transition  to  the  year  2000,  that  this  year’s 
symposium  will  have  a  panel  session  [  on  Multiple-Valued  Logic  in  the  Next 
Millenium:  Challenges  and  Perspectives.  We  look  forward  to  the  views  of  the 
panelists  and  indeed  all  participants  at  ISMVJ^9.  1 

For  each  ISMVL,  contributed  papers  go  through  a  rigorous  reviewing  process  based 
on  full  paper  submissions.  We  tliank  all  authors  who  contributed  their  work  for 
consideration  for  presentation  at  this  year’s  symposium.  On  our  own  behalf,  and  on 
behalf  of  all  the  authors,  we  also  thank  the  referees,  listed  elsewhere  in  these 
proceedings,  who  contributed  their  time  and  expertise  to  the  selection  process. 
Their  efforts  are  crucial  to  the  success  of  a  meeting  such  as  ISMVL. 

On  behalf  of  all  attendees,  we  wish  to  thank  Dr.  Rolf  Drechsler,  Symposium  Chair 
and  Prof.  Bernd  Becker,  Symposium  Co-Chair  for  their  extensive  efforts  in  the 
organization  of  this  year’s  ISMVL.  We  also  thank  Nicole  Drechsler,  Christoph 
Scholl  and  Frank  Schmiedle  for  their  contributions  to  the  Symposium.  And  finally 
we  thank  the  staff  at  the  IEEE  Computer  Society,  for  their  efforts  in  preparing  these 
proceedings. 

We  hope  that  you  truly  enjoy  all  aspects  of  ISMVL’99  and  that  you  will  feel 
encouraged  to  participate  in  ISMVL’2000  and  future  symposia. 

Elena  Dubrova  (Europe-Africa) 

Takahiro  Hanyu  (Asia-Pacific) 

D.  Michael  Miller  (Americas) 

ISMVL-99  Program  Co-Chairs 
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Michael  Miller,  University  of  Victoria 

Asia/Pacific  Co-Chair 
Takahiro  Hanyu,  Tohoku  University 

Europe/Africa  Co-Chair 

Elena  Dubrova,  Royal  Institute  of  Technology,  Kista 


Session!.:  Takahiro  Hanyu 
Session2a:  Ivo  Rosenberg 
Session2b:  Yutaka  Hata 
SESSION3A:  Jon  Muzio 
Session3b:  Grant  Pogosyan 
Session4a:  Claudio  Moraga 
Session4b:  Reiner  Hahnle 
Session5:  Elena  Dubrova 


SESSION6A:  Rolf  Drechsler 
Session6b:  Michitaka  Kameyama 
Session7a:  Radomir  Stankovic 
Session7b:  Lucien  Haddad 
Session8:  Michael  Miller 
Session9a:  Tsutomu  Sasao 
Session9b:  Okihiko  Ishizuka 


Mostafa  Abd-El-Barr 
Seiki  Akama 
Takafumi  Aoki 
Tomoyuki  Araki 
Matthias  Baaz 
Hafiz  Md.  Hasan  Babu 
Bernhard  Beckert 
Robert  J.  Bignall 
Ferdinand  Borner 
Jon  T.  Butler 
K.  W.  Current 
Debatosh  Debnath 
S.  Demri 
Rolf  Drechsler 
Elena  Dubrova 
Gerhard  W.  Dueck 
Daniel  Etiemble 
Bogdan  J.  Falkowski 
Craig  M.  Files 
Glen  Gulak 
Dilian  Gurov 
Lucien  Haddad 
Reiner  Hahnle 
Takahiro  Hanyu 
Yutaka  Hata 
Andreas  Herrfeld 
Takahiro  Hozumi 
Mou  Hu 

Okihiko  Ishizuka 
Tatiana  Kalganova 
Michitaka  Kameyama 
Naotake  Kamiura 
Hiroaki  Kikuchi 
Tadahiro  Kitahashi 
Eiji  Kobayashi 
Koji  Kotani 
Thomas  Lukasiewicz 
Luca  Macchiarulo 
Hajime  Machida 
Felip  Manya 
Masayuki  Matsumoto 


D.  Michael  Miller 
Shin-ichi  Minato 
Masahiro  Miyakawa 
Claudio  Moraga 
Masao  Mukaidono 
Noriaki  Muranaka 
Jon  C.  Muzio 
Yasunori  Nagata 
Akira  Nakamura 
Alioune  Ngom 
A.  Nozaki 

Manuel  Ojeda-Aciego 
Ewa  Orlowska 
Marek  Perkowski 
Grant  R.  Pogosyan 
Ivo  Rosenberg 
C.  Rozon 
Gernot  Salzer 
Tsutomu  Sasao 
Hiroshi  Sawada 
Micaela  Serra 
Charles  B.  Silio 
Dan  Simovici 
Katsuhiko  Shimabukuro 
Vlad  Petrovich  Shmerko 
Viorica  Sofronie-Stokkermans 
Radomir  S.  Stankovic 
Bernhard  von  Stengel 
Ivan  Stojmenovic 
Iwan  G.  Tabakow 
Noboru  Takagi 
K.  Tanno 
Hisayuki  Tatsumi 
Mitchell  Thornton 
Zvonko  Vranesic 
Shugang  Wei 
David  M.  Wessels 
Yasushi  Yuminaka 
Richard  Zach 
Zelko  Zilic 


xi 


Chair:  Takahiro  Hanyu 


Development  of  Quantum  Functional  Devices  for  Multiple-Valued  Logic  Circuits 


Toshio  Baba 

Fundamental  Research  Laboratories,  NEC  Corporation 
34,  Miyukigaoka,  Tsukuba,  Ibaraki  305-8501,  Japan 
baba@frl.cl.nec.co.jp 


Abstract 

Quantum  functional  devices  exhibiting  unique  current- 
voltage  characteristics  are  reported  for  the  application  of 
multiple-valued  logic  circuits.  Multiple  negative- 
differential-resistance  (NDR)  characteristics  in  drain 
current-voltage  characteristics  are  demonstrated  by 
using  multiple-junction  surface  tunnel  transistors  (MJ- 
STTs).  Some  multiple-valued  logic  gates  such  as  inverter 
and  literal  are  implemented  using  the  MJ-STTs. 
Oscillatory  characteristics  of  drain  current  under  gate 
modulation  are  shown  by  single  electron  transistors. 
Nonvolatile  multiple-valued  memory  devices  utilizing 
these  unique  characteristics  are  described,  and  the 
fundamental  operation  of  write  and  read  of  stored 
electrons  are  demonstrated. 


1.  Introduction 

The  operation  speed  and  memory  density  of  ultra  large- 
scale  integrated  circuits  (ULSI)  is  increasing  dramatically 
because  the  device  size  has  been  decreased  according  to 
the  scaling  rule.  Recently,  however,  severe  problems 
with  regard  to  complex  wiring,  large  propagation  delay 
time,  and  high  power  consumption  are  being  discussed  in 
order  to  improve  the  performance  of  ULSIs.  Even  for 
MOSFETs,  which  is  key  component  in  ULSIs, 
enhancement  of  speed  through  size-reduction  tends  to  be 
restricted  because  of  an  increase  in  short  channel  effects 
and  gate  leakage  current.  Therefore,  any  new  technology 
from  circuit  architecture  to  transistor  is  strongly  desired  to 
overcome  these  problems  and  to  improve  the  performance 
of  ULSIs  in  the  future. 

The  quantum  functional  devices  (QFD)  project  in  Japan 
began  in  1991  in  order  to  develop  new  devices  that  utilize 
quantum  effects  for  overcoming  the  problems  on  future 
ULSIs  [1].  This  project  lasts  ten  years  and  is  divided  into 
three  terms  (Fig.  1).  Beginning  with  Phase  I,  which 


consists  of  the  development  from  basic  technology  for  new 
devices,  to  Phase  III,  which  consists  of  integrated-circuit 
technology,  devices  and  circuits  that  are  promising  for 
future  ULSIs  are  being  developed  at  a  level  close  to 
practical  use.  New  quantum  functional  devices  are 
proposed  by  the  following  eight  member  companies:  NEC, 
Fujitsu,  Motorola,  Matsushita  Electric,  Sony,  Hitachi, 
Hitachi  Europe,  and  Toshiba.  These  devices  have  unique 
current-voltage  characteristics  compared  with  the 
conventional  MOSFETs  and  bipolar  transistors.  These 
devices  are  thought  to  be  very  useful  for  binary  logic 
and/or  memories  because  they  can  reduce  the  number  of 
components  and  wires,  and  they  can  lower  the  power 
consumption.  In  addition,  some  devices  could  be 
applicable  to  multiple-valued  logic  circuits,  which  may 
further  reduce  the  circuit  components  than  those  for  the 
binary  circuits. 

Negative-differential-resistance  (NDR)  characteristics, 
which  appear  in  most  of  the  quantum  functional  devices, 
are  key  features  that  reduce  the  circuit  components. 
Several  new  binary  logic  circuits  have  been  constructed 
using  resonant  tunneling  transistors  (RTT),  which  include 
resonant  tunneling  diode  (RTD)  structures  [2,  3]. 
Furthermore,  multiple  NDR  characteristics  are  very 
promising  for  the  application  of  multiple-valued  logic 
circuits  because  of  clear  multiple  threshold  characteristics. 
These  characteristics  were  also  developed  by  series 
connections  of  RTDs  or  RTTs  [4,  5]. 

We  proposed  the  surface  tunnel  transistor  (STT)  as  one 
of  the  quantum  functional  devices  because  it  exhibits  the 
NDR  characteristics  in  the  drain  current-voltage 
characteristics  mentioned  under  the  QFD  project  [6].  Since 
this  device  has  a  planar  structure  similar  to  the  MOSFET, 
integration  of  devices  is  much  easier  to  the  vertical 
structure  devices  like  RTDs.  Because  of  this  structural 
feature,  multiple  NDR  characteristics  are  demonstrated 
using  a  multiple-junction  surface  tunnel  transistor  (MJ- 
STT),  which  consists  of  series  connected  tunnel-junctions 
between  a  source  and  a  drain  [7, 8]. 

Multiple  NDR  characteristics  are  also  seen  in  the  drain 
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Figure  1.  Quantum  functional  device  (QFD) 
project's  long  term  research  plan. 


current  as  a  function  of  the  gate  voltage  for  single  electron 
transistors  (SETs)  [9].  This  unique  feature  is  also  attractive 
for  the  multiple-valued  logic  circuits.  Because  the  SETs 
are  inherently  very  tiny  structures  with  nano-meter  scale 
and  because  they  can  operate  with  very  low  power 
consumption,  very  large  scale  integration  are  expected. 

In  this  paper,  some  multiple-valued  logic  and  memory 
circuits  using  MJ-STTs  and  SETs  are  proposed  from  NEC 
as  examples  of  the  application  for  multiple-valued  logic 
circuits  using  the  QFDs.  Prospects  of  multiple-valued  logic 
circuits  using  the  QFDs  are  also  discussed. 

2.  Multiple-valued  logic  circuits  using  a 
multiple-junction  surface  tunnel  transistor 

Figure  2  shows  a  schematic  cross-section  of  the 
multiple-junction  surface  tunnel  transistor  (MJ-STT)  and 
its  expected  transistor  characteristics  with  corresponding 
band  diagrams.  In  this  structure,  the  isolated  p+  regions  (p+ 
islands)  are  inserted  in  the  n+  channel  between  the  p+ 
source  and  the  p*  drain.  Many  p+/n+  interband  tunnel 
junctions  connected  in  series  are  formed  so  that  they 
correspond  to  the  number  of  p+  islands.  Because  each 
forward-biased  p+/n+  junction  generates  NDR 
characteristics,  multiple  NDR  characteristics  are  expected 
for  MJ-STTs.. 

InGaAs-based  MJ-STTs  were  fabricated  using 
molecular-beam-epitaxy  (MBE)  regrowth  [7].  After  the 
formation  of  the  isolated  p+-region,  the  n+  -channel  and 
gate  layers  are  deposited  on  this  structure  by  MBE 
regrowth.  A  quasi  planar  device  structure  is  formed  using 
this  fabrication  process.  Details  of  the  fabrication  process 
and  the  parameters  of  the  devices  are  written  in  reference  8. 

An  example  of  the  transistor  characteristics  for  MJ- 
STTs  is  shown  in  Fig.  3.  Transistor  characteristics  with  six 
NDRs  were  clearly  observed  for  the  fabricated  MJ-STT 
with  6  forward-biased  tunnel  junctions.  The  exact  relation 
between  the  numbers  of  tunnel  junctions  and  NDRs  were 
confirmed  by  fabricating  the  MJ-STTs  with  different 


Figure  2.  Expected  transistor  characteristics 
of  the  multiple-junction  surface  tunnel 
transistor  (MJ-STT).  Insets  show  the 
schematic  of  an  ideal  MJ-STT  structure  and 
the  band  diagrams  between  the  source  and 
the  drain. 
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Figure  3.  Transistor  characteristics  of  a 
fabricated  InGaAs  MJ-STT  with  6  forward- 
biased  tunnel  junctions.  The  broken-line 
curve  represents  the  reverse  scan  at  VG  =  0V. 
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numbers  of  tunnel  junctions. 

Two  types  of  fundamental  multiple-valued  logic  circuits 
were  designed  for  MJ-STTs  and  they  were  implemented 
monolithically  using  the  InGaAs-based  MJ-STTs. 

Figure  4(a)  is  a  schematic  cross-section,  4(b)  is  a 
microscope  photograph,  and  4(c)  is  the  equivalent  circuit 


(C) 

Figure  4.  Three-valued  inverter  element:  (a) 
Schematic  cross-section,  (b)  microscope 
photograph  and  (c)  equivalent  circuit. 


for  a  fabricated  three-valued  inverter  circuit.  This  circuit 
consists  of  series-connected  one-p+-island  MJ-STTs.  The 
operating  principle  of  the  fabricated  logic  circuits  is  based 
on  either  monostable-bistable  transition  logic  (MOBIL)  or 
monostable-multistable  transition  logic  (MML)  [10,  5]. 

Figure  5  shows  the  input  and  output  traces  of  the  three¬ 
valued  inverter  circuit  depicted  in  Fig.  4.  Based  on  the 
results  of  measurements  for  the  fabricated  device,  we 
chose  a  clocked  supply  voltage  of  1.75  V.  The  input 
voltages  were  —1.4  V  for  the  “0”  logic  level,  -0.6  V  for  the 
“1”  logic  level,  and  0.2  V  for  the  “2”  logic  level.  The 
operation  of  the  three-valued  inverter  can  be  clearly  seen 
in  Fig.  5.  Furthermore,  it  is  found  that  the  output  is  kept 
even  after  the  input  pulse  returns  to  zero.  This  indicates 
that  this  inverter  circuit  has  a  latch  function. 

Figure  6(a)  is  a  schematic  cross-section,  6(b)  is  a 
microscope  photograph,  and  6(c)  is  the  equivalent  circuit 
of  a  fabricated  literal  circuit.  This  circuit  consists  of  series- 
connected  zero-p+-island  MJ-STTs  and  parallel-connected 
zero-p+-island  MJ-STTs.  The  operation  of  this  circuit  is 
also  based  on  the  MML  using  clocked  supply  voltage.  To 
explain  the  operation  principle  of  this  literal  circuit,  which 
was  originally  proposed  by  Waho  using  RTDs,  Fig.  7 
shows  the  peak  current  for  each  MJ-STT  in  this  circuit  as  a 
function  of  the  input  voltage  [5].  Each  MJ-STT  is  labeled 
to  "XT’,  "X2",  "X3M,  and  "X4".  Due  to  the  feature  of  the 
MML  or  series  connected  NDR  elements,  a  device  having 
the  lowest  peak  current  goes  to  a  high  voltage  state  first 
when  the  supply  voltage  is  raised  from  0  V  .  Based  on  this 
principle,  the  output  voltage  increases  only  when  the  input 
voltage  is  between  VA  and  VB,  where  the  sum  of  the  peak 
current  for  X3  and  X4  is  lower  then  those  for  XI  and  X2. 
Therefore,  literal  operation  can  be  achieved  in  this  circuit. 


Figure  5.  Input  and  output  traces  of  the 
three-valued  inverter  circuit. 
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Figure  6.  Literal  circuit:  (a)  schematic 
cross-section,  (b)  microscope  photograph, 
and  (c)  equivalent. 


INPUT  VOLTAGE 

Figure  7.  Relation  of  peak  current  for  MJ- 
STTs  in  the  proposed  literal  circuit. 


Figure  8.  Input  and  output  traces  of  the 
literal  circuit. 


The  fabricated  literal  circuit  displayed  in  Fig.  6  was 
designed  to  satisfy  the  relation  shown  in  Fig.  7.  To  control 
the  initial  peak  current  and  the  increased  peak  current 
caused  by  the  input  voltage,  the  driver  MJ-STT  is  split  into 
two  parts,  that  is,  the  parallel  connection  of  X3  and  X4. 
Figure  8  shows  the  input  and  output  traces  of  the  literal 
circuit  depicted  in  Fig.  6.  We  chose  a  clocked  supply 
voltage  of  1.2  V.  The  input  voltage  was  0  V  for  the  “0” 
logic  level,  0.4  V  for  the  “1”  logic  level,  and  0.8  V  for  the 
“2”  logic  level.  The  literal  function  of  xl\  which  gives  a 
high  output  level  only  for  "1",  is  clearly  demonstrated  in 
Fig.  8.  We  also  investigated  this  circuit  by  changing  the 
bias  voltage  of  VBi  and  Vb4,  and  obtained  the  literal 
function  of  not  only  x1 1  but  also  x00  and  x22. 
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3.  Multiple-valued  memory  using  single 
electron  transistor 

Using  oscillatory  characteristics  of  single  electron 
transistors  (SETs),  two  types  of  multiple-valued  memories 
were  developed.  Figure  9  shows  one  of  the  proposed 
memories  (Si  in-plane  floating-dot  memory).  This  device 
consists  of  a  source,  drain,  SET  island,  floating  dot, 
control  gate,  and  back  gate.  Two  artificial  narrow 
constrictions  between  the  SET  island  and  source/drain  act 
as  tunneling  barriers.  The  floating  dot  stores  electrons 
injected  from  the  control  gate  and  acts  as  a  memory  node. 
This  structure  was  fabricated  using  a  20-nm-thick  highly- 
doped  n-type  Si  film  on  a  SIMOX  (separation  by 
implanted  oxygen)  wafer.  To  pattern  a  10-nm  scale  fine- 
structure,  we  used  calixarene,  a  high-resolution  negative 
resist  under  electron-beam  lithography,  and  reactive  ion 
etching  with  CF4  gas.  The  Si  SET  island  was  30  nm  in 
diameter  and  the  floating  dot  was  15  nm  in  diameter.  The 
Si  surface  is  depleted  and  the  narrow  constriction  between 
the  island  and  source  (or  drain)  nearly  pinched  off.  The 
constriction  thus  acts  as  a  tunneling  barrier. 

Electrical  device  characteristics  were  measured  in  a  4He 
continuous-flow  cryostat.  The  device  was  voltage-biased 
symmetrically  between  the  source  and  drain  at  Vsd  =  1  naV . 
A  clear  periodical  current  oscillation  with  back-gate  and 
control-gate  voltages  (VB  and  Vc)  was  observed  (Fig.  10), 
and  the  formation  of  a  high  quality  SET  was  indicated. 
These  oscillations  were  observed  up  to  60  K.  Within  each 
oscillation  period,  increment/decrement  of  one  electron  in 
the  SET  island  occurred  by  means  of  capacitive  coupling 
with  the  gates.  Based  on  these  results,  we  can  compensate 
the  influence  of  gate  voltages  to  the  SET  island  by  keeping 
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Figure  9.  Scanning  microscope  photograph 
of  a  Si  in-plane  floating-dot  memory. 


the  bias  conditions  of  VB  =  -0.54  Vc  even  during  the  gate 
sweep  (shown  in  the  upper  curve  of  Fig.  10)  [11].  However, 
when  |VC|  or  |VCbI  exceeded  a  certain  threshold  voltage,  the 
current  started  to  oscillate  due  to  the  electron  injection 
from  die  control  gate  into  the  floating  dot  node  by  Fowler- 
Nordheim  (FN)  emission.  These  electrons  change  the 


Figure  10.  Current  versus  control-gate 
(back-gate)  voltage:  VB  scan  while  control 
gate  is  grounded,  Vc  scan  while  back  gate 
is  grounded,  and  control  and  back  gates 
scan  with  VB=-0.54VC. 


Figure  11.  Current  when  Vc  =  -5  -» -8  (A)  — >  - 
5  (B)  -8  (C)  ->  -2  (D)  -5  V  (E)  with  VB  =  - 

0.54  Vc.  Each  curve  is  shifted  upward  by  0.1 
nA  for  clarity. 
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potential  of  the  floating  dot  and  also  modify  the  charges  on 
the  SET  island. 

To  demonstrate  the  memory  function,  we  alternated  the 
charged  and  discharged  of  the  floating  dot  by  sweeping  the 
control  and  back  gate  voltages  and  kept  the  above 
condition  (Fig.  1 1).  Charging  and  discharging  were  seen  as 
current  oscillations  at  curves  A  and  D.  Once  the  floating 
dot  was  charged  (curve  A),  a  subsequent  sweep  of  Vc  in 
the  same  direction  did  not  change  the  dot  charge,  and  the 
current  showed  no  oscillation  (curve  C).  When  Vc  >  -5  V, 
discharge  occurred  (curve  D).  This  shows  the  memory 
function.  If  we  could  count  the  number  of  SET  oscillations 
during  charging  and  discharging  of  the  floating  dot  by  a 
count  circuit,  data  storage  could  be  multiple-valued.  Since 
this  type  of  reading  is  not  disturbed  by  background  charge, 
a  reliable  memory  operation  is  also  expected  in  the 
floating-dot  memory  devices  with  SET. 

We  have  developed  different  types  of  multiple-valued 
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Figure  12.  Schematic  view  of  the  SiN  SET 
memory  device. 
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Figure  13.  Write  and  erase  operation  of  the 
SiN  SET  memory  device. 


memory  device.  Figure  12  shows  a  schematic  view  of  the 
SiN  SET  memory  device.  The  operation  principle  of  this 
device  is  almost  the  same  as  the  SET  memory  described 
above.  Electrons  are  stored  in  the  electron  traps  of  the  SiN 
films  that  are  embedded  in  the  gate  insulator  of  the 
MOSFET  instead  of  the  floating  node.  Sensing  the  stored 
electrons  are  carried  out  by  an  A1  based  SET  formed  at  the 
position  of  the  gate  electrode  [11]. 

Similar  results  were  obtained  with  the  planar  SET 
memory,  which  indicates  a  proper  operation  of  multiple¬ 
valued  memory.  In  addition,  nonvolatile  characteristics 
were  demonstrated  as  shown  in  Fig.  13.  A  retention  time  of 
about  one  hour  was  measured.  This  retention  time  is 
shorter  than  the  expected  value  for  the  deep  electron  traps 
in  the  SiN  films.  By  improving  the  fabrication  process  and 
the  structural  design,  a  longer  retention  time  could  be 
obtained. 

4.  Discussion 

We  described  multiple-valued  logic  circuits  using  MJ- 
STTs  and  multiple-valued  memory  devices  using  SETs. 
These  are  the  limited  examples  of  the  circuit  application 
for  multiple-valued  logic  using  QFDs.  Most  of  the  QFDs 
have  negative-differential-resistance  characteristics  and  are 
considered  very  useful  for  implementing  multiple-valued 
logic  circuits.  Multiple  NDR  characteristics  are  easily 
demonstrated  by  making  the  series  connection  of  devices 
have  single  NDR  characteristics.  Using  this  approach, 
several  multiple-valued  circuits  have  been  demonstrated 
using  resonant-tunneling  diodes.  This  approach  is  also 
applicable  for  other  QFDs  having  NDR,  interband 
tunneling  diode,  and  resonant  interband  tunneling  diode. 
MJ-STT  is  an  example  of  a  new  device  structure  that  is 
constructed  with  STT  by  this  approach.  Therefore,  other 
QFDs  also  have  a  potential  to  implement  the  multiple¬ 
valued  logic  circuits. 

To  implement  the  multiple- valued  logic  circuits  using 
QFDs,  the  development  of  design  tools  is  very  important. 
Since  the  current-voltage  characteristics  and  materials  are 
different  from  the  conventional  transistor,  the  design  or 
simulation  tools  are  not  considered  to  be  directly 
applicable  to  QFDs.  A  device  simulator  handling  the 
quantum  phenomena  and  circuit  simulator,  including  the 
proper  device  model  for  QFDs,  should  be  developed.  The 
implementation  of  internal  models  of  QFDs  in  the  circuit 
simulator  is  also  desired  to  speed  up  the  circuit  simulation. 

The  most  important  issue  for  practical  usage  of  the 
QFDs  is  thought  to  be  the  development  of  new 
combination  circuits  or  architecture  for  the  QFDs  utilizing 
the  features  of  their  current-voltage  characteristics. 
However,  since  device  researchers  are  not  familiar  with  the 
system  design  and  since  system  researchers  are  bit  always 
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aware  of  new  devices,  discussions  about  new  devices  and 
new  circuits  between  both  groups  are  indispensable  for 
paving  the  way  for  developing  multiple-valued  logic 
circuits  using  QFDs. 

5,  Conclusion 

We  demonstrated  multiple  negative-differential- 
resistance  (NDR)  characteristics  in  drain  current-voltage 
characteristics  using  multiple-junction  surface  tunnel 
transistors  (MJ-STTs).  Multiple-valued  logic  gates  of 
ternary  inverter  and  literal  circuits  are  implemented  using 
the  MJ-STTs.  Oscillatory  characteristics  of  drain  current 
under  gate  modulation  are  showed  by  single  electron 
transistors.  Nonvolatile  multiple-valued  memory  devices 
utilizing  these  characteristics  to  store  electrons  in  floating 
nodes  or  electron  traps  are  described  and  the  fundamental 
operation  of  write  and  read  of  stored  electrons  are 
demonstrated. 
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ABSTRACT  :  This  paper  deals  with  a  multivalued 
extension  of  the  concept  of  a  binary  relation  on  a  set  E.  If 

R  is  such  an  r -valued  relation  on  E,  for  every  (x,  y)  G 

Z?2,  there  exists  exactly  one  iG  {0,  1 ,  ...,  r-1}  such  that 
the  degree  of  comparability  of  (x,  y)  with  respect  to  R  is 
equal  to  i.  The  set  of  all  r -valued  relations  on  E  is  then 
equipped  with  an  order  relation  <  and  turns  out  to  be  a 
Post  algebra  of  order  r.  This  first  part  of  the  paper  makes 
a  large  use  of  the  results  of  a  previous  paper  (1999  a).  In 
a  second  step ,  we  study  multivalued  product  of  relations, 
as  well  as  multivalued  reflexivity ,  symmetry , 
antisymmetry,  and  transitivity.  It  follows  for  example 
that,  from  a  multivalued  viewpoint ,  symmetry  and 
antisymmetry  turn  out  to  be  quite  different ,  since  the 
latter  is  hereditary  while  the  former  is  not.  We  finally 
investigate  some  properties  of  multivalued  equivalences , 
multivalued  orders  and  preorders. 

1.  Preliminaries. 


relations  considered  in  this  paper  will  be  binary,  from 
now  we  shall  omit  this  term  and  simply  speak  of 
“relations”.  Recall  now  that  every  powerset  P(Q)  is 

isomorphic  to  <2>^  via  the  boolean  characteristic 
function  theorem  (<2>  =  {0,1}).  We  will  apply  first  this 
result  to  Q  =  E2,  so  that  the  set  P(E2)  of  all  relations 
on  E  is  boolean-isomorphic  to  <  2  >(  E  \  In  fact  to  every 
relation  R  on  E,  we  associate  the  function  JC(R)  from  E2 
to  <2>  (also  denoted  R)  by  : 

(V  (x,  y)  E  E2))  R  (x,  y)  =  1  if  and  only  if  (x,  y)  E 
R  (consequently :  R  (x,  y)  =  0  if  and  only  if  (x,  y)  £  R). 
This  theorem  expresses  this  very  simple  fact :  if  a  R  is  a 
given  relation  on  E,  and  (x,  y)  any  element  in  E2,  only 
two  cases  may  happen  :  either  (x,  y)  belongs  to  R,  either 
it  does  not.  Consequently,  from  a  logical  viewpoint,  the 
concept  of  “relation  “  is  here  a  two-valued  one.  The  aim 
of  this  paper  is  to  extend  this  concept  to  the  multivalued 
case. 


§  1.  1  Center  of  a  bounded  distributive 
lattice. 

We  will  use  the  following  notations  :  if  one  has  x 
<  y  in  a  poset,  we  will  say  that  y  dominates  x. 
Supremum  and  infimum  of  two  elements  x  and  y  of  a 
lattice  T  will  be  denoted  by  x  v  y  and  x  a  y.  If  it  exists, 
the  supremum  (resp.  infimum)  of  any  indexed  family  x  i 
(i  E  I)  of  elements  of  T  will  be  denoted  by 
V  Xi  (  resp.  A  Xi )  .  Universal  elements  (if  they  exist) 
iei 

will  be  denoted  by  0  and  1  ;  we  shall  also  say  that  T  is  a 
(0,1)  -  lattice.  Recall  that  in  any  (0,1)  -  distributive 
lattice,  the  set  of  the  complemented  elements  is  a 
Boolean  algebra,  called  the  center  of  T.  The  complement 
of  any  element  a  of  a  boolean  algebra  will  be  denoted  by 
a  . 

§  1.  2  Binary  relations. 

Recall  that  a  binary  relation  R  on  a  set  E  is 
simply  a  subset  of  E2,  so  that  the  set  of  all  the  binary 
relations  on  E  is  the  powerset  P(E2),  which  is  usually 
ordered  by  inclusion :  R  C  S  if  (x,  y)  E  R  =>  (x,  y)  E 
S.  Moreover  (P(E2)  ;  U  ,  H  ,  ,  0,  E2)  is  a  boolean 
algebra,  so  that  we  can  define  join,  meet,  and 
complementation  for  binary  relations  : 
R  U  S  ;  R  PI  S  ;  R.  The  relation  R  is  called  the 
complement  of  the  relation  R  :  (x,  y)  E  R  if  and  only  if 
(x,  y)  £  R.  Recall  also  the  diagonal  A  of  E2  is  the 
binary  relation  :  A  =  {  (x,  x)  ;  x  E  E  }.  Since  all 


§  1.  3  Subdiagonals. 

Let  E  be  any  set  and  kEIN*(i.e  any  non-zero 
natural  integer).  Define  the  subdiagonal  of  order  k  of 
P(E2)  as  the  set  of  all  sequences  of  k  relations  on  E, 
subject  to  the  decreasing  chain  condition,  that  is  : 

k(E2)  =  {R  =  (R<i»  ....  R<k>)e[P  (E 2)]k/ 
R<j>  Z»R<j+i  >  ;1  i  j  <  k-1  } 

It  is  easily  proved  ([6],  Theorem  1)  that,  equipped 
with  the  product  order  (componentwise),  k(E2)  is  a 
complete  bounded  distributive  lattice,  and  also  that  an 
element  R  of  k(E2)  is  complemented  if  and  only  if  all  its 
components  R<  k  >  are  equal,  so  that  the  center  AR(E2) 
of  k(E2)  is  boolean  isomorphic  to  the  set  P(E2)  of  all 
relations  on  E,  under  the  map  P(E2)  -*  Afc(E2)  :  R  -> 
"R"  =  (R,  R,  ...,  R). 

§  1.  4  Post  algebras . 

We  recall  now  the  Epstein-Traczyk’s  definition  (see  [2] 
and  [8])  of  a  Post  algebra  :  given  any  integer  r  >  2,  a 
Post  algebra  of  order  r  is  a  bounded  distributive  lattice  P, 
whose  center  is  denoted  by  B,  such  that : 

1.  There  exists  in  P  a  finite  chain  <  r  >  with  r 

elements  :<r>  =  {0  =  eo<ei  < . <er _  i  =  1  }. 

2.  To  every  element  x  of  P,  one  can  associate  a 
family  (x  <*  o  ^  i  ^  r-1  °f  r  elements  of  B  subject  to 
“orthonormality”  conditions  (1.4. 1): 
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r-l 

V  x<!>=1  and  x<i>Ax<-i>  =  0  (for all  i  ^  j) 

i  =  0 

r-l 

such  that :  x=\/x<,>Aei  (1 .4.2) 

i  =  0 

3.  The  representation  (1.4.2)  is  unique  in  the 
following  sense  :  if  x  happens  to  admit  the  representation 

r-  1 

x  =  V  a'A  ei  where  a0,  a1,..,  ar_1  E  B,  are 

i  =  0 

subject  to  orthonormality  conditions  (1.4.1),  then 
x  <i>  =  a1  for  every  i  ♦ 

It  is  easily  shown  (e.g  [1])  that,  if  P  is  a  Post 
algebra  of  order  r,  then  the  integer  r  is  unique,  as  well  as 
the  chain  <  r  >  and,  for  every  x,  the  disjunctive 
components  x  <  1  >  ‘s.  One  can  also  prove  that  the 
subdiagonal  (r-l)(E2)  (cf  §1.3  above)  is  a  simple 
example  of  complete  Post  algebra  of  order  r  (see[6j). 

2.  Multivalued  relations. Quasi¬ 
relations. 


value  :  (p,  0,  ...,0,  p)  =  1  p  *  for  some  (common) 
relation  p  on  E.  ('  p  1  to  be  read  as  «quasi-p»).  '  p  1  is 
called  a  quasi-common  multivalued  relation  on  E  (of 
valuation  r),  and,  for  short,  a  quasi-relation  on  E.  It  is 
clear  that,  if  R  ='  p  ’,  then  for  every  (x,  y)  in  E2  the 
degree  of  comparability  of  x  to  y  belongs  to  the  pair  {0, 
r-l}  ;  a  multivalued  relation  which  is  not  a  quasi -relation 
is  said  actual.  A  multivalued  relation  R  such  that  R<  0  > 
=  0  =  R<  r "  1  >  is  said  strictly  actual  (its  non  empty 
components  are  all  intermediary  ones)  ♦ 

Note  that,  for  r  =  2,  P2(E2)  =  {(R,  R)  ;  R  e 
P(E2)}.  It  follows  that  the  power  set  P(E2)  of  all 
common  relations  on  E  is  isomorphic  -under  the  map  : 
R-*  (R,  R)-  to  the  lattice  P2(E2)  °f  mvrs  on  E  of 
valuation  2. 

3.  Post  algebra  structure  on  Pj(E2). 

§  3.  1  Lattice  structure  on  the  set  of 
multivalued  relations . 


In  sections  2-5  below  we  shall  use  of  some  of  the 
results  of  [6],  where  we  proved  many  theorems 
concerning  Pr(Q),  that  is  the  lattice  of  all  r-ordered 
partitions  of  some  set  Q.  We  will  recall  below  without 
proofs  some  of  these  results,  and  apply  them  to  the  case 
Q  =  E2,  that  is,  we  shall  translate  them  in  a  “relational” 
context. 

Definition  2.1  ([6],  Definition  2.1)  Let  E  be  any  set 
and  r  E  IN*,  r  >  2.  We  call  a  multivalued  relation  of 
valuation  r  on  E ,  (briefly  mvr),  every  family  R  = 
(R<0>,  ...,  R<r  “ 1  >)  of  r  relations  on  E  such  that : 

U  R <l>  =  E2  and  R<1>  fl  R<j>=  0 

0  si  sr-  1 

for  all  i  *j. 

In  other  words,  a  mvr  of  valuation  r  is  simply  a 
partition  of  the  set  E2  by  the  means  of  r  of  its  subsets, 
what  we  called  in  [6]  an  r -ordered partition  of  E2 
Therefore,  for  every  ordered  pair(x,  y)  in  E2,  there  exists 
exactly  one  i  =  0,  1, r-l  such  that  (x,y)eR<i>;i  (= 
iR(x,  y))  is  the  degree  of  comparability  of  x  to  y  with 
respect  to  R.  The  R<  1  >  will  be  called  the  disjunctive 
components  of  R  (there  are  r  such  components). 

If  R  is  a  mvr  on  E  of  valuation  r,  we  will  also  say 
that  R  is  an  r -valued  relation  on  E.  For  the  sake  of  a 
proper  distinction  between  the  multivalued  case  and  the 
previous  one,  it  will  be  convenient  from  now  on  to  call 
“common  relations”  on  E  what  we  had  simply  called 
“relations”  that  is,  elements  of  P(E2).  Note  that, 
whatever  relations  may  be,  either  multivalued  or 
common,  they  will  remain  “binary”  anyway,  so  that  we 
will  go  on  omitting  this  term.  An  important  special  case 
happens  when  all  intermediary  relations  (R<  * >  ;  1  <  i  ^ 
r  -  2)  are  empty.  It  follows  that  such  a  mvr  must  have  the 


Let  R  =(R  <1:>)  o  <  i  s  r  -  1  and  S  =  (S  <*  >)  o  s  i  s  r-l 

be  any  two  mvrs  on  E.  Define  R  <  S  by  : 

(V  i6{0,  1 . r-l})  R<!>  C  u  s<j> 

j  =  » 

then,  one  can  prove  ([6],  Theorem  2.1)  that  <  is  an  order 
relation  on  Pj-(E2).  Hence,  according  to  this  definition,  a 
multivalued  relation  R  precedes  S  if  and  only  if,  for  every 
(x,  y)  E  E2,  the  degree  of  comparability  of  (x,  y)  w.r.t  S 
is  at  least  the  the  degree  of  comparability  of  (x,  y)  w.r.t 
R.  We  then  proved  ([6],  Theorem  2.3)  that  P,{E2), 
equipped  with  this  order  relation  is  a  complete  bounded 
distributive  lattice:  for  every  pair  R  = 

(R^)  0<i^r-landS  =  (S<i>)0<i^r-lof 
elements  of  Pi<E2),  one  has: 


[Rvs]<i>=  R<i>n  U  s <j>  u 

j  =  0 

[RAs]<i:>=  R<l>n  U  s<j>  u 

for  all  i  =  0,1,...,  r-l. 

In  other  words,  an  element  (x,  y)  of  E2  has  its 


s<i>nU  r <J> 

j  =  0 

s<l>n  U  r<j> 

j  =  i 


degree  of  comparability  w.r.t  R  V  S  equal  to  i  if  and 
only  if  alternatively,  either  it  has  degree  i  w.r.t  R  and 
degree  at  most  equal  to  i  w.r.t  S,  either  it  has  degree  i 
w.r.t  R  and  degree  at  most  equal  to  i  w.r.t  S.  We  also 
proved  that  a  multivalued  relation  is  complemented  in  the 
lattice  Pj<E2)  if  and  only  if  it  is  a  quasi -relation.  The 
center  A|{E2)  of  Pj(E2)  is  boolean-isomorphic  to  P(E2) 
via  the  isomorphism  :  p  ?p\  So,  we  will  be  able  to 
identify,  in  the  lattice-theoretic  sense,  a  common  relation 
p  and  the  quasi -relation  'p  '  ;  when  restricted  to  quasi¬ 
relations,  sup  and  inf  so  coincide,  up  to  isomorphism, 
with  set-theoretic  meet  and  join  for  (common)  relations. 
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Lemma  :  From  the  formulas  above,  we  get : 

(R  V  S)  <0>  =  r<0>  n  s  <0> 

(R  aS)<0>  =  R<0>U  S<0> 

(R  V  S)  <r'1  >  =  R  <r'1  >  U  S<r-}  > 

(RAS)<r-1>  =  R<r-1>  f|S<r-1> 

§  3.  2  Fundamental  chain  ;  the  postian 
structure  for  the  lattice  of  multivalued 
relations. 

In  a  further  step  ([6],  th.2.7),  we  introduced  in 
Pi*(E^),  what  we  called  a  fundamental  chain  in  the 
following  way  :  (eo,  ej,  ...,  er  .  ])  is  the  r-tuple  of 
elements  of  [  P(E2)]^  defined  for  0  <  i  <  r  - 1 ,  by  : 

(e  j)  j  =  if  i  =  j  and  (e  j)  j  =  0  otherwise,  that  is  :  e  {  = 
(0,  0,  ...  0,  E^,  0,...,0).  Clearly,  every  e  {  is  a  mvr  (for 

every  (x,  y)  E  E  \  the  degree  of  comparability  of  x  to  y 
is  equal  to  i)  ;  moreover,  with  respect  to  the  order  of 
Pr(E^),  the  e  j  ‘s  form  a  chain  with  r  elements  : 

0  =  e  o  <  e  i  <  ...  <  e  r  .  j  =  1  ♦ 

On  the  other  hand,  recall  that  every  R<  k  >  *s  a 
common  relation,  so  that 4  R<  k  >  ‘  is  a  quasi  -relation. 
Then  one  can  prove  ([6],  th.  2.9)  the  first  fundamental 
representation  theorem: 

Theorem  1  (disjunctive  representations)  :  For  every 
multivalued  relation  R  of  valuation  r  : 

R  =  (R  <1  >)  o  <  i  <  r-i  e  Pr(E2),  one  has  : 

R  =  V  ,R<i>,Aei 

Osisr-I 

Moreover,  such  a  representation  is  unique  in 
the  following  sense  : 

If  one  has  R  =  V  '  X'  ’  a  e* 

Osisr-1 

where  the  'Xj' s  are  r  quasi -relations  such  that  : 
such  that  V  '  X  * '  =  1  (  =  (0,  E2)) 

Osisr-J 

and  '  X  1  '  a  '  X j '  =  0  (  =  (E2,  0,...,0  ) )  for  all 
i  *  j,  then  X*  =  R<  1  >  for  every  i  =  0,  1, ...,  r-1  ♦ 

From  Theorem  1,  it  follows  this  important  result 
([6],  corollary  2.8.1): 

Theorem  2  :  For  every  set  E  and  every  r^  2, 
Pr(E^)  is  a  complete  Post  algebra  of  order  r. 

It  follows  that  the  present  theory  is  actually  a  set- 
theoretic  realization  of  multivalued  logic  Post 
conceptions.  Since  Pr(E^)  is  complete,  it  follows  also 
that,  for  every  family  (S  j)  j  e  i  of  mvrs  on  E,  then 

V  Si  and  s»  exist, 
iel  iel 

§  3.  3  Monotonic  components  of  a 
multivalued  relation.  First  isomorphism 
theorem . 


In  a  further  step,  one  define  ([6],  Theorem  2.2, 
formulas  2.7),  for  every  mvr  R  =  (R<  1  >)  0  <  i  <  r-1  ^ 
Pr(E^),  the  sequence  of  its  (r-1)  monotonic  components 

r  -  1 

R  <i>  =  U  R<j>  for  i  =  I  ,  2  , ,  r  -  1. 
j  =  i 

Actually,  to  R  we  associate  D  (R)  = 

(R<  1  >,  R<2»  ....  R<r-1  >)  e  [ P  (E2)]r  -  1  by  : 

Then  one  can  prove  ([6],  Theorem  2.2) : 

Theorem  3  (First  fundamental  isomorphism  theorem) : 

D  is  a  lattice  isomorphism  from  (Pr(E^),  <) 

onto  ((r  -  1)(E^),  <),  the  inverse  mapping  D“^ 
being  defined  by  : 

R<0>  =  R<1>  ;  R<j>  =  R<j>  n  R<j+ 1>  for  Is  j  <  r-2 
R  <r‘1>  =  R<r-l>  ♦ 

Remarks  3.3.1  Note  that  (r  -  1)(E2)  is  the 

subdiagonal  of  order  (r-1)  of  E2,  as  defined  supra  in  §  1.3. 
Note  also  that  the  “one-to-one”  part  of  the  proof  is  usual 
in  measure  and  probability  theory  :  to  every  family  of  r 
pairwise  disjoint  relations  on  E,  we  associate,  by  a  one- 
to-one  correspondance,  a  decreasing  sequence  of  (r-1) 
relations.  For  instance,  for  r  =  3  and  R  =  (R<  ®  >,  R<  1  >, 

r<2>) 

,  one  has  : 

D(R)  =  (R<I>UR<2>,  R<2>)  =  (R<i>,R<2>) 
and  conversely,  with  R<  i  >  D  R<  2  >  ,  we  get : 

D  (  R<  1  >,  R<  2  >) =  (  R<  1  >  »  R<  1  >  n  R<  2  >  ,  R<  2  > ) 

=  (R<0>,R<1>,R<2>). 

Remarks  3.3.2  From  Theorem  3,  it  also 

follows  that  R  <  S  (resp.  R  =  S)  if  and  only  if  R<  k>  < 
S<k>  (resp.  R<k>=  S<k>)  for  k  =  1,  2,...,  r-1.  It  also 
follows  that  [R  v  S]  <k>=  R  <k>  v  S  <k>  and 
[R  a  S]  <k>  =  R  <k>  a  S  <k>  •  On  the  other  hand,  let 
D 1  be  the  restriction  of  D  to  the  respective  centers  of  the 
two  lattices,  Pj{E^)and  (r  -  1)(E^);  then  !Di  is  a 
boolean  isomorphism  of  these  centers.  We  then  get : 

D('  p  ')  =  "p"  (for  the  definition  of  "p”,  see  above  §1.3), 
for  every  common  relation  p  on  E.  It  follows  that  a  mvr 
R  is  a  quasi -relation  if  and  only  if  all  its  monotonic 
components  R<  k  >  are  equal. 

Theorem  4  (Monotonic  representations)  :  For  every 
multivalued  relation  R  E  Pr(E^),  one  has  : 

R  =  V  '  R  <i>  'a  ej 

1  sisr-  1 

Moreover,  such  a  representation  is  unique  in 
the  following  sense : 

If  one  has  R=  V  'Xj  'a  ej 

1  sisr-l 

where  the  ’X1*  s  are  (r-1)  quasi-relations  such 
that  1  X  i  4  £: 4  X  j  +  j  ‘for  every  i,  then  X  j  = 

R<  j  >  for  every  i  =  1,  2,...,  r-1. 
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4.  Negation  and  pseudocomplement  of  a 
multivalued  relation. 

§  4.1  Negation . 

We  also  proved  in  ([6],  Theorem  2.4)  that  we 
could  equip  Pr(E^)  with  an  involutive  lattice  duality  by 
associating  to  every  mvr  R  of  valuation  r : 

R  =  (R<  i  >)  0  <  i  <  r.i  e  P^E2),  the  r-tuple  R®  defined 
by  :  (R®)  <*>  =  R<r~  1  -  i>  for  0  <  i  <  r  -  1  (one 
has  inverted  the  order  of  the  components,  hence  the 
weight  of  the  comparabilities)  :  R®  will  be  called  the 
negation  of  R.  We  have,  for  every  R  and  S  : 

(R  v  S)®=R®  a  S®  ;  (R  a  S)®=R®  v  S®  ;  (R®)®=R. 
If  R  is  a  quasi-relation,  say  R  =  ‘  p  so  is  R®  =  ‘  p 

4.2  Pseudocomplements . 

We  also  proved  in  ([6],  Theorem  2.5)  that  Pj*(  E^) 
is  a  Heyting  algebra,  that  is,  to  say  that,  for  every  (R,  S) 
E  [Pr(E^)]2,  the  inequation  :  R  a  X  ^  S  admits  a 
greatest  solution  in  Pr(E^),  denoted  (S  I  R),  and  called 
inf-relative  pseudo-complement  of  S  to  R.  In  fact, 
formulas : 

(  S  I  R)<  j  >  =  (  S<i>  U  R<i>)  for  every  j 

1  *  i  s  j 

provide  the  monotonic  components  of  (S  I  R)  (cf.[5J). 
Dually,  we  proved  existence  and  unicity  for  the  sup- 
relative  pseudo-complement,  that  is,  the  least  solution  of 

RvX^S.  We  may  then  conclude,  as  in  ([6],  Theorem. 

2.6)  that,  for  every  R  in  Pr(E2),  the  equation  R  a  X  =  0 
admits  a  greatest  solution  R*,  which  is  simply  the  quasi¬ 
relation  :  R*  =  ‘  R  <  0  >  \  The  map  R  ->  R*  is  a  lattice 
duality  such  that  R  =  0  o  R*  =  1.  Hence 

(R  v  S)*=  R*  a  S*  ;  (R  a  S)*=  R*  v  S% 

Finally  note  that,  if  R  is  a  quasi-relation,  say  R  =  ‘  p  \ 
so  is  R*  =  1  p  Moreover,  for  every  mvr  R,  one  has  : 

(R®)  *  =  ®  _  *  0  >  1 

5.  Characteristic  functions.  Second 
isomorphism  theorem. 

Let  <  r  >  be  any  chain  with  r  elements  :  <  r  >  = 

(ao=  0<  ai<  ..<  ar .  i  =  1},  e.g  :  a  t  =  —  (0  s:  k  r-1). 

r-1 

Then,  for  every  mvr  R  =  (R< *  >)  0  <  i  <  r-1  G  Pr<E2), 

( E^ 

we  may  associate  3C(R)  E  <  r  >v  }  by  : 

( V(x,  y)  e  E2)  JC(R)  (x,  y)  =  a  j  (x,  y)  e  R<  J  >. 

We  proved  easily  ([6],  Th.  2.10)  that  JC(R)  is 
well  defined  on  E^  ;  JC(R)  is  the  (multivalued) 
characteristic  function  of  R  and  will  be  denoted  from  now 
R .  We  then  proved  : 


Theorem  5  (Second  fundamental  isomorphism  theorem) 

For  every  set  E  and  every  integer  r  £  2,  Pr(E^) 

(ic2\ 

is  lattice-isomorphic  to  <r>  ,  via  the 

mapping  JO. 

Remarks  5.1  It  follows  that  :  Card  (Pi*(E^))  = 

<  r  >((CardE)  }  =  Card  [(r  -1)(E^)].  In  the  finite  case, 

with  Card(E)  =  n,  then  Card  (Pr(E^))  -  r^n  \  It 
follows  also  that,  for  every  family  (S  i)  i  ^  i  of  mvrs  on 


E,  one  has  : 


V  Si 

iel 


max  Si  and 

i  e  i 


min  Si 
iei 


6.  Product  of  multivalued  relations. 


Recall  first  that  if  P  and  Q  are  two  common 
relations  on  a  set  E,  one  usually  defines  the  product 
relation  P  ®  Q  in  the  following  way  :  for  every  (x,  y) 
E  E^  :  (x,  y)  E  P  ®  Q  if  and  only  if  there  exists  z  E  E 
such  that  (x,  z)  E  P  and  (z,  y)  E  Q.  Hence  P  ®  Q  is  also 
a  common  relation.  The  product  of  common  relations  is 
shown  to  be  associative  and  to  have  some  other 
properties  (cf.  for  instance  [3]) 


Definition  6.1  :  Let  R  and  S  be  any  two  multivalued 
relations  on  E.  Define  R  ®  S  as  the  multivalued  relation 
whose  characteristic  function  R  ®  S  has  the  value  : 
K3.S  ( x  ,  y )  =  V  R(x.z)  AS(z,y)  (6.1) 

2  E  E 

for  every  (x,  y)  E  E2  ♦ 

To  understand  this  definition,  recall  (see  Theorem 

5)  that  the  map  R  R  is  a  one-to-one  correspondence 
and  also  a  lattice  isomorphism,  so  that  whenever  R  and  S 
are  given,  then  R  and  S,  which  are  also  given,  are  both 

maps  from  E^  onto  <r>.  Since  <x>  is  a  complete  lattice, 
it  follows  that  the  right  part  of  relation  (6. 1)  defines  also 
some  map  G  from  E^  onto  <n>.  By  a  repeated  use  of 
Theorem  5,  there  exists  exactly  one  mvr  T  such  that  G  = 
T.  We  denote  it  by  T  =  R  ®  S,  and  called  it  the  product 
of  the  multivalued  relations  R  and  S. 

Theorem  6  :  Let  R  and  S  be  any  two  quasi 
relations  on  E,  say  R=‘p‘  and  S  =  ‘  p  \  Then 
R  ®  S  is  a  quasi  relation  and 

Proof  :  Recall  that  ao  =  0  and  ar_i  =  1.  Since  R  =  4  p  ‘  = 
( p,  0,...,  0,  p)  and  S  =  ‘  p  ‘  =  (  p,  0,...,  0,  p), 
both  p(x,y)  and  p(x,  y)  belong  to  {0,1}.  From  relation 

(6.1),  it  then  follows  that  R  (£)  S  (  x  ,  y  )  £  {0,1}  for 
every  (x,  y)  in  E2.  Hence  the  mvr  R  0  S  is  a  quasi- 
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(6.8) 


relation.  Moreover,  from  relation  (6.1),  it  follows  that 
R  (*)  S  (  x  ,  y  )=  1  =  C  p  ‘  ®  4  \i  ‘)(x,  y)  if  and  only  if 
there  exists  z  in  E  such  that  R  (x,  z)  =  1  and  S  (z,  y)  =  1, 

that  is  (x,  z)  E  p  and  (z,  y)  E  p.  In  other  words 

(‘  p  *  ©  1  \i  ‘)(x,  y)  =  1  if  and  only  if  (x,  y)  belongs  to 

the  (common)  relation  p  ®  p.  Hence  (‘  p  1  ®  *  \i 

=  p  ©  p.  Since  (‘  p  ‘  ‘  p  *  )(x,  y)  E  {0,1},  it  follows 

that  (‘  p  1  ®  1  p  * )  < J >  =  0  for  1  ;£  j  <  r-2 ,  and  also 
(‘  p  1  ®  *  p  4  )(x,  y)  =  0  if  and  only  if  (x,  y)  E  p  ®  p 
=  (‘p<®>  ‘)<0>*  Finally  ‘p‘®‘|A‘*‘p®|i< 

♦  Therefore  the  product  of  mvrs  turns  out  to  be  a  proper 
extension  of  the  product  of  common  relations  as  recalled 
supra. 

Remark  6.1  :  Assume  E  to  be  a  finite  set  with  n 
elements,  say  E  =  {1,  2,..,  n}  and  R  to  be  a  mvr  on  E. 
Define  the  square  postian  matrix  A  of  size  n,  whose 
element  A  j  j  in  i-th  row  and  j-th  column  is  equal  to  R  (i, 
j)  E  <  r>.  With  our  notations  in  ([5]),  we  write  A  E 
Mn(<  r  >).  A  is  said  to  be  the  matrix  of  the  multivalued 
relation  R  (denoted  A  =  Mat  (R)).  It  is  easily  shown  that 
if  A  =  Mat  (R),  and  B=  Mat  (S),  then  A  O  B  =  Mat  (R  ® 
S),  where  O  denotes  the  postian  matrix  product  (see  [5]) 

♦  We  will  now  study  some  properties  of  the  product  of 
mvrs,  and  will  leave  some  of  the  proofs  to  the  reader. 

Theorem  7  (Associativity)  :  For  all  R,  S,  T  E 
Pr(E2)  (R  ®  S)  ®  T  =  R  ®  (S  ®  T)  (6.2) 

(the  proof  is  left  to  the  reader) 

Theorem  8  (Order compatibility).  Let  R,  S,  T  be  any 
three  multivalued  relations  and  R<  S.Then 

R  ®  T  <  S  ©  T  (63)  and  T  ©  R  <  T  ©  S  (6.4) 


[A  Si\®R<;  A  (Si  ®  R) 

\ iei  /  iei 

(the  proof  is  left  to  the  reader  :  recall  that  Pj-  (E2)  is 

complete,  so  that  A  Si  exists  for  every  (S  *)  [  e  i). 
iei 

Theorem  10  (Unit  element).  Let  I  denote  the 
quasi-relation  I  =  A  Then,  for  every 
multivalued  relation  R  : 

I  ©  R  =  R  ®  I  =  R  (6.9) 

Proof  :  For  every  (x,  y)  in  E2 
I  ®  R  (  x  ,  y )  =  ZVE 1  (  x  ,  z )  a  R  (  z  ,y ) 

On  the  other  hand  I  (x,  z)  E  {  0, 1}  and  I  (x,  z)  =  1  iff 
(x,  z)  E  A,  that  is,  x  =  z. 

Hence,  for  every  (x,  y)  in  E2  : 
I  ©  R  (  x  ,  y )  =  R  (  x  ,  y  )  andI®R  =  R. 

Hence  I  is  the  unit  element  of  (Pr  (E2),  ®)- 

Definition  6.2  :  A  mvr  R  is  said  to  be  reflexive  if 
I  <  R. 

Remarks  6.2  :  Since  I  =  <  A  R  is  reflexive 

iff  (x,  x)  E  R  <r" 1  >  for  every  x  E  E.  Consequently,  if 
R  is  a  quasi -relation,  say  R  =  ‘  p  then  R  is  reflexive  iff 
(x,  x)  E  p  for  every  x  E  E.  Hence 4  p  ‘  is  a  reflexive  mvr 
iff  p  is  a  common  reflexive  relation. 

7.  Transitivity  for  multivalued  relations. 

§  7.  1  Transitivity . 

We  first  state  a  lemma: 

Lemma  7.1  :  Let  f  be  the  characteristic  function  of  a 
multivalued  relation  R  (see  Theorem  5). Then,  for  every 
k=  1,  2,...,  r-1,  the  monotonic  component  f<  k  >  is  the 
characteristic  function  of  '  R  <  k  >  ’• 


Proof  :  R  ^  S  implies  R  ^  S,  so  that  R  (x,  y)  <  S  (x,  y) 
for  every  (x,  y)  in  E2.  It  follows  : 
R®T(x,y)=  V  R(x,z)  Al(z.y) 

z  E  E 

s  V  S(x,z)  a  T  (  z  ,  y  )  =  S®T(x,y),  so 

zEE 

that  (6.3)  holds.  An  analagous  proof  is  valid  for  (6.4). 

Theorem  9  (Distributivities).  Let  (S  j)  j  e  I  be  any 

family  of  multivalued  relations  on  E.  Then 
for  any  multivalued  relation  R,  one  has: 

R®  (  V  Si\=  V  (R®  Si)  (6-5) 

\iei  J  iei 

V  Si  )  ®  R  =  V  (Si®R)  (6.6) 

iei  /  iei 

R®  /  A  Si  A  (R®  s0  (6-7> 

V  i e i  /  iei 


Proof :  Recall  first  that  f  (x,  y)  €E  <  r  >,  so  that 
f<  k  >  (X.  y)  =  (f(x.  y))<k>e<2>.  In  fact, 

-.  if  f  (x,  y)  =  ao  =  0,  then  f<k>  (x,  y)  =  0  for  every  k. 

-.  if  f  (x,  y)  =  aj,  with  1  <  j  <  r  -2,  then 
f<k>  (x,  y)  =  1  for  1  <;  k  s  j,  and  f<k>  (x,  y)  =  0 
for  j  +  1  sksr-1. 

-.  if  f  (x,  y)  =  ar-i  =  1,  then  f<  k  >  (x,  y)  =  1  for  every  k. 
Hence  f<  k  >  (x,  y)  =  1  f  (x,  y)  =  aj  for  j  (x,  y) 

6R<i  >  for  j  a  k  (x,  y)  e  U  R<J>=R<k>. 

k  s  j  £T-  1 

Since  f<  k  >  (x,  y)  E  { 0,1 },  it  follows  that 

f<  k  >  (x,  y)  =  0  (x,  y)  E  R<  k  >.  It  follows  that  f<  k  > 

is  the  characteristic  function  of  the  mvr 
(  R<k>,  0,  ....  0,  R<k>),  that  is  '  R  <  k  >  ’  ♦  Finally, 
with  our  notations,  we  get :  if  f  =  R,  then  f<  k  >  = 

'(R)<k>'  ♦ 

Define  now  R1  =  R  and,  by  induction  :  Rn+1  =  Rn  ®R. 
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Definition  7.1  :  A  multivalued  relation  R  is  said  to 
be  transitive  if  R  2  <  R 

Theorem  11  :  Let  R  be  any  given  multivalued 
relation.  Then  R  is  transitive  if  and  only  if 
each  of  its  monotonic  components  R<k>  is 
(commonly)  transitive. 

Proof :  R  is  transitive  if  and  only  if  R  2  <  R,  that  is,for 
every  (x,  y)  in  E2  : 

zYeR(x,z)  a  R  (  z  ,  y  )  s  R  (  x  ,  y )  (7.1) 

From  remarks  (3.3.2),  it  follows  that  (7.1)  is 
equivalent  to  (7.2)  below: 

(VeR(x>z)  a  R  ( z  ,y )  j<k>  s  ( R(  x  ,  y )  )<k> 

for  every  k  =  1,  2,..,  r-1.  On  the  one  hand,  by  the 
definition  of  R  :  (R  (x,  y))  <  k  >  =  ((S)  <  k  >)  (x,  y). 

On  the  other  hand,  since  every  A  A<k>  is  a 
lattice  homomorphism  (see  remarks  3.3.2) : 

( z Ye  R  (  x .  z )  AR(z,y)|<k> 

=  zYE(E,(x’z))<k>  <k> 

=  2YE^RL><x-z))A((&)<k>(z>y>) 

Thus  the  relation  (7.2)  turns  out  to  be  equivalent 
to  ••  VE(  ( R  )<k>(  X ,  z) )  A  ( ( R)<k>(  z ,  y ) ) 

s  (R)<k>(x,y)  (73) 

for  every  k  and  every  (x,  y).  On  the  other  hand,  by  the 
preliminary  lemma  7.1  : 


Hence  (7.3)  can  be  rewritten : 

vE(  ,(Ru>,(*-z>H  '(R)<k>'(z-y)) 

s  ,(R)<k>'(x-y). 

It  follows  that  for  every  k,  *  (R)<k>*  is  a 
transitive  mvr,  so  that  (R)<k>  is  a  common  transitive 
relation.  This  is  a  necessary  and  sufficient  condition  for  R 
to  be  transitive. 

Corollary  11.1  :  A  quasi-relation  ‘  p  ’  is 
transitive  if  and  only  if  p  is  (commonly)  transitive. 

Proof :  Obvious,  since  R  <  ^  >  =  p  for  every  k  ♦  Hence, 
multivalued  transitivity  turns  out  to  be  a  proper 
extension  of  common  transtitivity. 

§  7.  2  Transitive  closure . 

Theorem  12  :  Let  (R  i)  i  e  I  be  any  family  of 
transitive  multivalued  relations  on  E.  Then  the 

infimumR=  A  Ri  is  transitive. 

iei 

Proof  ;  On  account  of  the  formula  (6.7),  and  the 
transitivity  of  R{,  one  has  :  R2  = 


A  Ri)2=  /  A  RiW  A  Ri  W  A  ( Ri ® Rj ) 

,eI  \»ei  j  \iei  j  ije,2 


= 

A 

Ri2l  A 

A  ( Ri  ®  Rj ) 

Lie  i 

i,  j  e  i 2 

-  i*j 

< 

[A 

R,ur 

A  (  Ri  ®  Rj  )1  = 

iei 


i.  jei2 
L  i*j 


=  R  A  Z  s  R, 


so  that  R  is  transitive. 

Corollary  12.1  :  For  every  mvr  S  there  exists  the 
least  transitive  mvr  6  dominating  S,  called  the  transitive 
closure  of  S. 

Proof  :  Let  V  be  the  set  of  all  multivalued  transitive 
relations  R  on  E  such  that  S  <  R  ;  V  is  obviously  not 
empty  since  it  contains  1  =  '  E2  \  From  the  previous 
theorem,  it  follows  that  the  infimum  6  of  V  exists  and  is 
a  transitive  relation.  Moreover,  6  G  V  and  S  <  6,  hence 
6  is  the  required  multirelation  which  dominates  S. 


8.  Multivalued  preorder  relations. 

Definition  8.1  :  A  multivalued  relation  which  is 
reflexive  and  transitive  is  called  a  multivalued  preorder 
(shortly  an  mv-preorder) 


Theorem  13  :  Let  R  be  a  multivalued  preorder. 
Then  R2  =  R. 

Proof  :  R  is  transitive  so  that  R  ®  R  <  R.  And  R  is 
reflexive,  so  that  I  <  R.  It  follows  R  ®  I  <  R  ®  R  (see 
Theorem  8).  Since  R  ®  I  =  R  (see  Theorem  10),  it 
follows  R  ®  R  =  R  ♦ 


Theorem  14  :  A  multivalued  relation  R  is  a 
multivalued  preorder  if  and  only  if  every 
R  <  j  >  is  a  (common)  preorder. 

Proof :  R  is  an  mv-preorder  if  and  only  if  R  is  reflexive 
and  transitive.  On  the  one  hand,  R  is  transitive  if  and 
only  if  every  R  <  j  >  is  (commonly)  transitive  (Theorem 
1 1).  On  the  other  hand,  R  is  reflexive  if  and  only  if  every 
R  <  j  >  is  (commonly)  reflexive  (see  Remarks  6.2).  So 
that  R  is  an  mv-preorder  iff.  every  R  <  j  >  is  a  (common) 
preorder  ♦ 

It  follows  that  a  quasi -relation  ‘  p  ‘  is  a 
multivalued  preorder  if  and  only  if  p  is  a  (common) 
preorder  relation  (proof  is  obvious  since  R  <  ^  >  =  p  for 
every  k).  Hence,  multivalued  preorders  turn  out  to  be 
proper  extensions  of  common  ones. 
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9.  Classes  of  antisymmetry. 

Definition  9.1  :  An  r-valued  relation  R  is  said  to  be 
class  k  -antisymmetric  (lsksr  -  1)  if  its  monotonic 
component  R  <  k  >  is  (commonly)  antisymmetric.  A 
mvr  R  is  said  to  be  antisymmetric  if  there  exists  k  (Is 
k  <  r  -  1)  such  that  R  is  class  k  -antisymmetric. 

Remarks  9.1  :  Recall  R<k>  is  (commonly) 

antisymmetric  if  and  only  if 

(V  (x,  y)  £  E  2)  (x,  y)  e  R  <k>and  (y,  x)  G  R  <k>  => 
x  =  y.  On  the  other  hand,  in  order  that  (x,  y)  belongs  to 
R  <k>  =  U  R<j>,  it  is  necessary  and  sufficient 

k  s  j  s  r-1 

that  R  (  x  ,y )  £  at,  that  is,  the  degree  of  comparability 
of  x  to  y  is  greater  or  equal  than  k.  It  follows  that  a  mvr 
R  is  class  k-antisymmetric  if  and  only  if,  whenever  both 
comparabilities,  of  x  to  y  and  y  to  x,  are  greater  or  equal 
than  k,  then  x  =  y.  Examples  :  a)  R  is  class  1- 
antisymmetric  if  and  only  if  R(x,y)*0  and 
R  (  y  ,  x  )  ^  0  imply  x  =  y :  in  other  words,  there  are  not 
distinct  elements  such  that  their  mutual  degrees  of 
comparability  are  both  non  zero,  b)  R  is  class  (r  -  1)- 
antisymmetric  if  and  only  if  R  (  x  ,y )  =  1  and 
R  (  y  ,x  )  =  1  imply  x  =  y.  In  other  words,  there  are  not 
distinct  elements  such  that  their  mutual  degrees  of 
comparability  are  both  maximum.  As  we  can  infer  from 
these  two  examples,  antisymmetry  may  bring  some 
limits  to  the  possibility  of  differentiating  elements. 

Theorem  15  :  A  multivalued  relation  R  is 
antisymmetric  if  and  only  if  it  is  class  (r-1)- 
antisymmetric.  A  quasi-relation  ‘  p  ‘  is 
antisymmetric  if  and  only  if  p  is  (commonly) 
antisymmetric. 

Proof  :  If  R  is  class  (r-l)-antisymmetricr  then  it  is 
antisymmetric.  Conversely,  assume  R  to  be  class  k  - 
antisymmetric.  Recall  this  means  :  (V  (x,  y)EE^) 

[(x,  y)  E  R  <k>  and  (y,  x)  E  R  <k >  =>  x  =  y  ]. 

Since  R  <  i  R  •<  2  >-^  ...^  R  <  r-1  >,  ^ 
follows  that,  for  every  p  £  k,  (x,  y)  E  R  <  p  >  and  (y,  x) 
ER  <p>=>  (x,  y)  E  R  < k >  and  (y,  x)  E  R  <k>so  that 
x  =  y.  It  follows  that,  if  R  is  class  k-  antisymmetric, 
then  for  every  p  £  k,  R  is  class  p-antisymmetric.  We  will 
say  that  antisymmetry  is  “hereditary”  property.  Hence 
every  antisymmetric  relation  is  class  (r-1)- 
antisymmetric.  If  R  is  the  quasi -relation  ‘  p  \  then 
R  <  k  >  =  p  for  every  k,  so  that  R  is  antisymmetric  if  and 
only  if  p  is  (commonly)  antisymmetric*  Hence, 
multivalued  antisymmetry  turns  out  to  be  a  proper 
extension  of  common  one. 

10.  Multivalued  order  relations. 

Definition  10.1  :  A  multivalued  relation  which  is 
reflexive,  transitive  and  class  k-  antisymmetric  (1  ^  k  ^  r 
-  1)  is  called  a  class  k-multivalued  order.  A  multivalued 


relation  R  is  said  to  be  a  multivalued  order  if  there  exists 
k(l^k<r  -  1)  such  that  R  is  a  class  k-multivalued 
order.  In  such  cases,  for  every  (x,  y)  in  E2,  the  degree  of 
comparability  of  x  to  y  will  be  called  the  degree  of  order 
of  x  to  y  ( w.r.t  R). 

Theorem  16  :  A  multivalued  relation  R  is  a 
class  k-multivalued  order  if  and  only  if  R  is  a 
multivalued  preorder  and  if  R<k>  a 

(common)  order. 

Proof  :  From  Definition  10.1,  it  follows  that  R  is  an 
mv-  order  of  the  class  k,  if  and  only  if  two  conditions  are 
fulfilled  :  a)  R<k>  is  (commonly)  antisymmetric  ;  b)  R  is 
an  mv-  preorder.  In  such  case,  every  R<j>  is  a  common 
preorder  (Theorem  14),  so  that  R<k>  is  a  (common)  order 
♦  The  converse  is  obviously  true. 

Theorem  17  ;  A  multivalued  relation  R  is  a 
multivalued  order  if  and  only  if  it  is  a  class  (r- 
l)-multivalued  order.  A  quasi-relation  ‘p‘  is  a 
multivalued  order  if  and  only  if  p  is  a 
(common)  order. 

Proof  :  If  R  is  a  class  (r-l)-multi valued  order  relation, 
then  R  is  a  (multivalued)  order.  Conversely,  if  R  is  a 
multivalued  order,  then  R  is  a  mv-preorder.  Moreover,  R 
must  be  class  k-antisymmetric,  for  some  integer  k  (1  <  k 
<  r-1).  From  Theorem  15,  it  then  follows  that  R  is  class 
(r-l)-antisymmetric,  so  that  R  is  a  class  (r-l)-multi valued 
order  *If  R  is  the  quasi -relation  ‘p  \  then  all  its 
monotonic  components  are  equal :  R  <  j  >  =  p  for  every  j. 
Hence  R  =  ‘p  ‘  is  a  mv-order  if  and  only  if  p  is  a 
common  order  ♦  Hence,  multivalued  orders  turn  out  to  be 
proper  extensions  of  common  ones*  Hence  a  class  k- 
multivalued  order  R  is  completely  determinated  by  a 
decreasing  sequence  of  (r-1)  common  preorders  on  E: 
R<1>3  R<2>^.. R  <r-l  >  such  that  for  p  £  k, 
all  the  R  <  p  >  ‘s  are  (common)  orders.  Note  that,  from 
the  fact  that  all  monotonic  components  R  <  p  >  (p  ^  k) 
are  orders  in  the  common  sense,  we  cannot  derive  that  the 
disjunctive  components  R  <  P  >  are  (common)  orders, 
except  for  the  case  k  =  r-1,  since  R  <  r-i  >  =  R  <  r'*  >- 

11.  Classes  of  symmetry. 

§  11.1.-  Recall  that  a  common  relation  p  on  E  is 
said  to  be  symmetric  if  (V  (x,  y)  E  E  2) 

(x,  y)  E  p  =»  (y,  x)  E  p  (11.1) 

§11.2.  Definition  11.2  :  An  r-valued  relation 
R  is  said  to  be  class  k  -symmetric  (1  £  k  £  r  -  1)  if  its 
monotonic  component  R<  k  >  is  (commonly)  symmetric. 
A  multivalued  relation  R  is  said  to  be  symmetric  if  there 
exists  k(l<k^r-l)  such  that  R  is  class  k  -symmetric. 
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Remarks  11.2  :  R  <  fc  >  is  (commonly)  symmetric  if 
and  only  if  (V  (x,  y)  E  E2)  (x,  y)  E  R  <  k  > 

(y,  x)  E  R  <  k  >.  On  the  other  hand,  in  order  that  (x,  y) 

belongs  to  R  <  ^  >  =  U  R  <  J  >,  it  is  necessary  and 

k  sj  sr-1 

sufficient  that  R  (  x  ,y  )  ak,  that  is,  the  degree  of 
comparability  of  x  to  y  is  greater  or  equal  than  k.  It 
follows  that  a  mvr  R  is  class  k-symmetric  if  and  only  if, 
for  every  (x,  y)E  E  2  R  (  x  ,y  )  £  at  holds  if  and  only  if 
R  (  y  ,  x  )  >  ak  holds.  In  other  words  a  mvr  R  is  class  k- 
symmetric  if  and  only  if,  for  every  (x,  y)  E  E2,  the  two 
mutual  degrees  of  comparability,  of  x  to  y  and  y  to  x,  are 
both  greater  or  equal  than  afc,  or  alternatively,  both  lesser 
than  afc-l  (in  the  latter  case,  we  must  suppose  k  £  2). 
Examples  :  a)  R  is  class  1-symmetric  if  and  only  if 
R(x,y)*ao  =  0  holds  if  and  only  if  R  (  y  ,  x )  *  0 
holds,  b)  R  is  class  (r  -  l)-symmetric  if  and  only  if 
R  (  x  ,y  )  =  a  r-i  =  1  holds  if  and  only  if  R  (  y  ,x  )  =  1 
holds  ♦  Moreover  it  should  be  emphasized  that,  while 
antisymmetry  is  hereditary  (see  Theorem  15),  symmetry 
is  not.  More  precisely,  assume  R  <  k  >  to  be  symmetric 
and  p  £  k.  Then,  (x,  y)  E  R  <p  >  =>  (y,  x)  E  R  <  k  >,  so 
that  (y,x)ER<k  >.  From  this,  we  cannot  derive  (y ,  x) 
ER<p>}  which  would  have  been  necessary  to  prove  the 
symmetry  of  R  <p>. 

Theorem  18  A  quasi-relation  *  p  ‘  is 
symmetric  if  and  only  if  p  is  (commonly) 
symmetric. 

The  proof  is  obvious4-Hence,  multivalued 
symmetry  turns  out  to  be  a  proper  extension  of  common 
symmetry. 

§  11.3.-  Definition  11.3  :  An  r-valued  relation 
R  is  said  to  be  class  k- strongly  symmetric  (1  <  k  <  r  -  1) 
if,  for  every  p  ^  k,  all  its  monotonic  components  R  <p  > 
are  (commonly)  symmetric.  A  multivalued  relation  R  is 
said  to  be  strongly  symmetric  if  there  exists  k  (1  <  k  < 
r  -  1)  such  that  R  is  class  k-strongly  symmetric. 

Clearly  every  strong  symmetric  relation  (resp. 
class  k-strong  symmetric)  is  symmetric  (resp.  class  k- 
strongly  symmetric),  but  all  converses  are  false. 

Theorem  19  :  A  multivalued  relation  R  is 
strongly  symmetric  if  and  only  if  it  is  class 
(r-l)-strongly  symmetric.  A  quasi -relation  ‘  p  ‘ 
is  strongly  symmetric  if  and  only  if  p  is 
(commonly)  symmetric. 

Proof :  If  R  is  class  (r-l)-strongly  symmetric,  then  it  is 
strongly  symmetric.  Conversely,  assume  R  to  be  class  k- 
strongly  symmetric,  then  for  every  p  ^  k,  all  its 
monotonic  components  R<p>  are  (commonly) 
symmetric.  Hence  R<r_i>  is  symmetric,  so  that  R  is 
class  (r-l)-strongly  symmetric  ♦  If  R  is  the  quasi-relation 
p  \  then  R<k>  =  p  for  every  k.  Then  one  has  the 
following  logical  equivalences  :  R  is  mv-symmetric  p 
is  commonly  symmetric  R  is  mv-strongly 
symmetric. 


12.  Multivalued  equivalence  relations. 

Definition  12.1  :  A  multivalued  relation  R  which  is 
reflexive,  transitive,  and  symmetric  (resp.  class  k- 
symmetric),  is  called  a  multivalued  equivalence  relation 
(resp.  a  class  k-multivalued  equivalence  relation ).  In  such 
cases,  for  every  (x,  y)  in  E2,  the  degree  of  comparability 
of  x  to  y  will  be  called  the  degree  of  equivalence  of  x  to 
y  ( w.r.t  R). 

Definition  12.2  :  A  multivalued  relation  which  is 
reflexive,  transitive,  and  strongly  symmetric  (resp.  class 
k-strongfy  symmetric),  is  called  a  multivalued  strong 
equivalence  relation,  (resp.  a  class  k-multivalued  strong 
equivalence  relation ). 

Remarks  12.2  :  Recall  that  a  mvr  R  is  said  to  be 
strongly  symmetric  if  there  exists  k  such  that  for  every  p 
£  k,  every  R  <  p  >  is  a  (common)  symmetric  relation, 

that  is  to  say  :  (V  (x,  y)  E  E2)  (x,  y)  E  R  <  p>  =>  (y,  x) 
ER  <  p  >  ;  it  follows  that  every  multivalued  strong 
equivalence  relation  is  a  multivalued  equivalence  relation. 

Theorem  20  :  A  multivalued  relation  R  is  a 
multivalued  equivalence  relation  if  and  only  if 
every  R<  j>  is  a  (common)  preorder  and  if  there 
exists  k  (1  <  k  <  r  -1)  such  that  R  <  >  is  a 

(common)  order.  A  multivalued  relation  R  is  a 
multivalued  strong  equivalence  if  and  only  if 
every  R<j>  is  a  (common)  preorder  and  if  there 
exists  k  (1  <  k  <  r  -1)  such  that  for  every  p  ^  k, 
R<p>is  a  (common)  equivalence  relation. 

The  proof  is  pure  routine  ♦  Hence  a  multivalued 
strong  equivalence  R  is  completely  determinated  by  a 
decreasing  sequence  of  (r-1)  common  preorders  relations 
on  E  :  R  <  j  >  D  R<2  ••  R  <  r-l  >  such  that 

there  exists  k  such  that,  for  p  £  k,  all  R  <  p  >  ’s  are 
common  equivalences. 
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Abstract 

In  this  paper  we  compare  effects  of  two  different  domain 
groups  for  switching  functions  to  the  efficiency  of  calcula¬ 
tion  of  spectral  transforms  (ST)  representations  and  the 
complexity  of  Decision  diagrams  (DDs)  representations. 
Dyadic  groups  and  quaternion  groups  are  assumed  for  do¬ 
main  groups  for  switching  functions.  We  compared  space 
and  time  complexity  in  calculation  of  STs  representations 
through  FFT.  DDs  are  compared  in  terms  of  their  basic 
characteristics ,  the  depth ,  the  width ,  and  the  size.  The 
area  of  a  DD,  defined  as  the  product  of  size  and  width, 
and  for  word-level  DDs,  the  ratio  between  the  number  of 
non-terminal  and  constant  nodes  are  discussed  as  another 
characteristics  of  DDs  dependent  on  the  domain  groups. 
It  is  shown  that  the  quaternion  groups  have  advantages  in 
processing  of  large  switching  functions.  When  number  of 
variables  grows,  these  advantages  increases. 

1.  Introduction 

An  algebraic  structure  is  suitable  for  logic  design  if  it 

1.  Provides  compact  representations  of  switching  func¬ 
tions, 

2.  Permits  computation  of  the  representations  for  a  given 
function  /  efficiently  in  terms  of  space  and  time. 

The  first  requirement  should  ensure  realizations  with  lo¬ 
gic  networks  occupying  small  area  and  consisting  of  a  small 
number  of  logic  elements.  Efficiency  in  computation  of  a 
ST- representation  is  usually  a  basis  for  efficient  algorithms 
for  manipulation  and  calculations  with  represented  func¬ 
tions.  The  same  applies  to  DDs  that  are  graphical  rep¬ 
resentations  of  ST-representations  [16],  [17].  Thus,  the 
second  requirement  strongly  interferes  with  applications,  as 
for  example,  testability,  verification,  and  similar,  performed 
through  these  representations. 

1.1  Boolean  structures 

Classically,  logic  design  is  performed  in  the  Boolean  algebra 
over  the  set  B  =  {0, 1}  in  terms  of  the  logic  AND  (V)  and 
OR  (A).  The  Boolean  ring  over  B  in  terms  of  AND  and 
modulo  2  addition,  EXOR  (0),  is  a  closely  related  algebraic 
structure. 

From  1990,  convenience  of  the  Boolean  ring  in  the  above 
sense  is  greatly  confirmed  both  theoretically  and  experi¬ 
mentally  [8],  [9],  [10].  Thus,  the  importance  of  AND- EXOR 
synthesis  considerably  raises  [8],  [9].  It  is  further  supported 


by  AND-EXOR  related  decision  diagrams  [9],  which  extend 
its  applicability  to  switching  functions  with  a  large  number 
of  variables. 

1.2  Vector  spaces 

Spectral  transform  approach  to  calculation  of  polynomial 
representations  in  terms  of  AND  and  EXOR  proved  very 
efficient  [1],  [2],  [3].  In  this  approach,  the  switching  func¬ 
tions  of  a  given  number  of  variables  are  considered  as  ele¬ 
ments  of  linear  vector  spaces  over  the  Galois  field  GF( 2). 
The  spectral  techniques  based  on  other  spectral  transforms 
used  in  switching  theory  and  logic  design,  consider  switch¬ 
ing  functions  as  elements  of  linear  vector  spaces  over  the 
field  of  rational  numbers  Q  or  the  complex-field  C.  It  is 
assumed  that  the  logic  values  0  and  1  are  formally  replaced 
by  the  integers  0  and  1,  or  in  some  cases  suitably  coded  as 
(0,1) ->(1,-1)  [6]. 

1.3  Domain  groups 

The  group-theoretic  approach  to  switching  theory  permits 
to  determine  a  relationship  between  the  Boolean  ring  and 
vector  spaces  used  in  switching  theory.  In  this  approach, 
the  Boolean  ring  and  vector  spaces  of  switching  functions 
over  GF( 2)  or  C  can  be  uniformly  considered  as  algebraic 
structures  on  finite  dyadic  groups  CJ  =  C2  x  •  •  •  x  C2,  where 
x  denotes  the  direct  product,  and  C2  =  (B,0).  This  rela¬ 
tionship  extends  to  the  Boolean  algebra  through  the  relation 
between  the  Boolean  ring  and  the  Boolean  algebra.  There¬ 
fore,  the  finite  dyadic  group  is  a  basic  algebraic  structure 
usually  used  for  switching  theory  and  logic  design. 

Definition  1:  An  n- variable  switching  function  is  defined 
as  the  mapping  /  :  C%  ->  B. 

Thus,  being  dependent  on  two-valued  variables,  the 
switching  functions  are  naturally  considered  as  functions 
on  the  dyadic  groups.  At  the  same  time,  being  two- valued 
functions,  the  set  of  all  switching  functions  of  n- variables 
is  the  dyadic  group  C%n  under  the  componentwise  EXOR 
performed  over  truth- vectors. 

These  basic  features  determine  complexity  of  representa¬ 
tions,  manipulations,  and  calculations  with  switching  func¬ 
tions.  If  n  is  large,  these  features  become  a  restrictive  factor 
for  many  methods  and  techniques.  For  example,  determin¬ 
ation  of  coefficients  in  ST-representations,  as  for  example, 
the  Reed-Muller  expressions  and  related  Kronecker  expres¬ 
sions  [9],  by  FFT-like  algorithms  [1],  [2],  expresses  expo¬ 
nential  complexity  of  0(2n)  and  0(n2n)  in  terms  of  space 
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and  time.  The  same  applies  to  arithmetic,  Walsh,  and  other 
related  representations  of  switching  functions  [18].  Thus,  if 
n  is  large,  for  example  n  >  30,  such  algorithms  are  hardly 
applicable  on  most  of  standard  computer  architectures. 

Decision  diagrams  on  CJ  are  subjected  to  the  same  re¬ 
strictions.  Irrespective  of  the  optimization  by  choosing  dif¬ 
ferent  decomposition  rules,  a  DD  on  C%  has  n  levels  con¬ 
sisting  of  nodes  with  two  outgoing  edges.  Complexity  of 
DDs  on  C% ,  as  for  example  Binary  DDs  (BDDs),  approx¬ 
imates  on  the  average  0{2n  jn)  [9].  Thus,  when  n  is  large, 
we  cannot  build  BDD  for  /. 

In  [19],  the  following  question  is  asked:  ’’The  ultimate 
purpose  must  be  to  find  out  whether  this  group  (the  qua¬ 
ternion  Q2)  may  be  as  significant  for  logic  synthesis  as  it 
seems  to  be  for  filtering  and  other  signal  processing  tasks” . 

Considerations  in  this  paper  are  an  attempt  towards  ap¬ 
proaching  to  an  answer  to  this  question.  Recently,  few  pa¬ 
pers  discussed  different  topics  in  Fourier  analysis  on  non- 
Abelian  groups  and  application  in  switching  theory  [13], 
[14],  [15],  [19].  In  this  paper,  we  present  a  short  compila¬ 
tion  of  some  selected  results  concerning  quaternion  groups. 
Then,  we  give  a  critical  analysis  and  comparison  of  them, 
which  shows  suitability  of  quaternion  groups  for  some  tasks 
in  logic  design.  Since  there  are  no  proper  analogies  of  the 
Boolean  algebra  or  Boolean  ring  based  on  the  quaternion 
groups,  the  comparison  is  restricted  to  vector  space  struc¬ 
tures  based  on  the  dyadic  groups  and  the  quaternion  groups 
over  C. 

1.4  Comparisons 

We  compare  efficiency  of  dyadic  groups  and  the  quaternion 
groups  in 

1.  Complexity  of  calculation  of  coefficients  in  ST- 
representations  through  FFT. 

2.  Complexity  of  various  DDs  on  dyadic  groups  and 
Fourier  DDs  on  the  quaternion  groups. 

We  assume  that  in  an  n- variable  switching  function  /,  each 
triplet  of  variables  x,-,  £j ,  x*,  where  each  variable  takes  val¬ 
ues  in  C2,  is  replaced  by  a  variable  Xr  in  Q2*  In  that  way, 
the  domain  group  C%  for  /  is  replaced  by  Q f>,  for  n  =  3Ar, 
by  C2Q2  ^or  71  =  3*  +  1,  and  by  C^Q\  for  n  =  3A:  -f  2.  C4 
is  the  cyclic  group  of  order  4. 

2.  Fourier  Transform 

In  this  section,  we  briefly  review  basic  definitions  of  Fourier 
transform  on  non- Abelian  groups.  For  more  information, 
we  refer  to  classical  references,  as  [5],  or  the  more  recent 
publications  as  [18]. 

Denote  by  C(G)  the  space  of  complex  functions  on  a  fi¬ 
nite  not  necessarily  Abelian  group  G  of  order  g .  Denote 
by  T  the  dual  object  of  G.  T  consists  of  K  unitary  irredu¬ 
cible  representations  Rtu(x)  of  G  over  C.  Note  that  R^x) 
stands  for  a  non-singular  ( rw  x  r^)  matrix  over  C.  For  each 
non- Abelian  group,  at  least  one  of  the  representations  is  of 
the  order  rw  >  1. 

Definition  2:  The  direct  and  inverse  Fourier  transforms 


of  a  function  /  6  C(G)  are  defined  respectively  by, 

9- 1 

S/M  =  rwg-1J2f(u)Rw(u~1),  (1) 

u=0 

K-l 

/(*)  =  E  IV(S/(ti»)R.(*)),  (2) 

tU=0 

where  for  a  matrix  Q,  Tr( Q)  denotes  the  trace  of  Q,  i.e., 
the  sum  of  elements  on  the  main  diagonal  of  Q. 

From  thiis  definition,  the  Fourier  coefficients  (1)  for  /  on  a 
non- Abelian  group  are  (rw  xrw)  matrices  when  rw  >  1.  The 
same  definition  applies  to  Abelian  groups.  In  that  case,  all 
the  unitary  irreducible  representations  are  one-dimensional, 
i.e.,  they  reduce  to  the  group  characters. 

FFT  algorithms  on  non- Abelian  groups  are  introduced 
in  [7] .  Algorithms  for  calculations  in  this  paper  are  based 
on  their  matrix  interpretation  in  [12],  [13]. 

3.  Complexity  of  FFTs 

In  this  section  we  compare  the  space  and  time  com¬ 
plexity  of  FFT  on  dyadic  and  quaternion  groups  by  the 
way  of  examples  of  mcnc  benchmark  functions  used  in 
logic  design  and  random  generated  switching  functions. 
Multiple-output  functions  /o*/i  *  •  *  are  represented 

by  the  integer  equivalents  fz  determined  by  the  mapping 
fz  —  o  2*/i  [6]-  For  hardware  limitations,  the  com¬ 
parison  is  restricted  to  small  benchmark  functions.  For 
detailed  theoretical  considerations  and  further  examples  we 
refer  to  [15]. 

Walsh  transform  is  the  Fourier  transform  on  dyadic 
groups.  Calculation  of  the  Walsh  transform  of  switching 
function  by  FFT  requires  more  operations  than  calculation 
of  other  ST-representations  on  dyadic  groups,  as  for  ex¬ 
ample,  the  Reed-Muller,  Kronecker,  or  arithmetic  expres¬ 
sions  [11].  Thus,  it  is  enough  to  compare  complexity  of  cal¬ 
culations  of  Fourier  transforms  on  dyadic  and  quaternion 
groups.  The  derived  conclusions  hold  even  stronger  for 
other  ST-representations  on  dyadic  groups.  It  is  enough 
to  compare  implementations  through  FFT.  In  DDs  based 
calculations,  we  actually  implement  FFT.  For  a  given  /, 
efficiency  is  achieved  thanks  to  peculiar  properties  of  /, 
permitting  reduction  in  DDs.  Therefore,  for  a  fair  compar¬ 
ison  for  arbitrary  functions,  we  should  refer  to  calculations 
through  DTs.  However,  in  that  case,  the  number  of  opera¬ 
tions  is  equal  to  that  in  FFT.  Thus,  it  is  again  enough  to 
compare  just  FFTs. 

Fig.  1  compares  the  number  of  operations  in  FFT  on 
dyadic  groups  of  order  23r  and  quaternion  groups  of  order 
r.  It  shows  that  for  n  >  10,  the  quaternion  groups  require 
fewer  number  of  operations.  Note  that  the  Walsh  transform 
matrix  requires  just  additions  and  subtractions,  while  the 
Fourier  transform  matrix  on  Q2  involves  complex  unit  i  = 
v/-T.  However,  in  this  case,  advantages  are  taken  from 
the  appearance  of  zero  elements  in  the  transform  matrix. 
For  analytical  expressions  of  the  number  of  operations  and 
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details  of  implementation  of  FFT  on  the  compared  groups, 
see  [15].  Further  details  on  the  subject  are  available  by  the 
first  author. 


TABLE  II 

Comparisons  of  domain  groups. 


Table  I  shows  time  (t)  and  space  (m)  required  to  per¬ 
form  FFT  on  dyadic  (d)  and  quaternion  groups  (q)  for  some 
switching  functions  of  different  number  of  variables  n.  For 
n  =  8,9,10,14,  we  use  the  domain  groups  Cf,  £7f,  C20, 
C\4,  and  CAQl,  Ql,  C2Q2,  CAQ\,  respectively.  Compar¬ 
ison  of  the  considered  groups  is  given  in  Table  II  in  terms 
of  the  ratio  rt  =  td/tq  and  rm  =  md/mq  of  the  used  time 
and  space.  Table  III  and  Table  IV  give  the  same  informa¬ 
tion  for  random  generated  functions  denoted  by  fun(n)  for 
n  =  9, 10, 12, 14.  Time  unit  is  1msec  =  10-3sec.  The 
space  is  given  in  KBytes.  Calculations  are  performed  on  a 
133MHz  Pentium  PC  with  32MBytes  of  main  memory. 

TABLE  I 
Complexity  of  FFT. 


Fig.  2  compares  time  required  to  perform  FFT  on  C$r 
and  Q1  for  different  values  of  r.  Fig.  3  compares  the  cor¬ 
responding  space  requirements. 

This  consideration  shows  that  quaternion  groups  permit 
faster  implementation  of  FFT  for  large  switching  functions 


TABLE  III 

FFT  for  random  functions. 


TABLE  IV 

Comparison  of  FFT  for  random  functions. 


/ 

n 

rm 

fun(9) 

0.66 

2.30 

fun  (10) 

0.45 

2.35 

fun  (12) 

0.18 

2.79 

fun  (14) 

0.10 

2.80 
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n 


Fig.  2.  Time  requirements. 
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n 

Fig.  3.  Memory  requirements. 

than  dyadic  groups,  at  the  price  of  a  comparatively  small 
increase  of  the  required  space.  For  example,  for  n  >  10 
for  at  about  ten  times  faster  implementation  of  FFT,  three 
times  more  space  is  required.  When  the  number  of  variables 
n  grows,  the  advantages  of  quaternion  groups  increase. 

4.  Decision  Diagrams 

Decision  Diagrams  (DD)  are  convenient  data  structure  for 
representation  of  discrete  functions  [11].  A  DD  is  a  rooted 
acyclic  graph  consisting  of  the  root  node,  a  set  of  non¬ 
terminal  nodes,  and  a  set  of  constant  nodes  connected  with 
edges.  For  a  given  function  /,  a  DD  is  designed  by  the 
reduction  of  the  corresponding  Decision  Tree  (DT)  repres¬ 
enting  /  [11]. 

Complexity  of  a  DD  is  usually  characterized  by  its 

1.  size  -  the  total  number  of  nodes  (non- terminal  and 
constant  nodes), 

2.  width  -  the  maximal  number  of  non-terminal  nodes 
per  a  level,  where  a  level  in  a  DD  consists  of  nodes  to 
which  the  same  variable  in  /  is  assigned. 

3.  depth  -  the  number  of  levels  in  the  DD. 

We  suggest  to  consider  the  area  (a)  of  a  DD  defined  as 
the  product  of  width  (w),  and  size  (s),  thus  a  =  ws .  In  DDs 
bases  logic  design,  the  area  should  be  a  rough  measure  of 
space  occupied  by  the  networks  produced  through  DDs. 

In  word-level  DDs,  it  is  useful  to  consider  the  ratio  rn  of 
non-terminal  (ntn)  and  constant  nodes  (cn),  rn  =  ntn/cn. 
In  DDs  based  realizations,  each  non-terminal  node  requires 
a  circuit.  In  DDs  based  calculations,  a  subprocedure  is 
assigned  to  each  non- terminal  node.  The  constant  nodes 
are  considered  as  inputs  into  circuits,  respectively,  subpro¬ 
cedures.  Thus,  in  some  applications,  it  may  be  useful  to 
reduce  the  number  of  non-terminal  nodes  at  the  price  of 
increased  number  of  constant  nodes.  Clearly,  in  bit-level 
DDs,  rn  =  s/2. 

4.1  Fourier  decision  trees 

Various  DDs  are  defined  by  using  different  decomposition 
rules  [11].  Fourier  DDs  are  defined  with  respect  to  the 
Fourier  decomposition  defined  by  (2).  From  FFT  theory, 
the  Fourier  transform  on  a  finite  group  G  decomposable 


into  the  product  of  n  subgroups  Gi  may  be  considered  as 
the  n-dimensional  Fourier  transform  on  the  constituent  sub¬ 
groups  Gi  [14]. 

Definition  3:  Fourier  decision  tree  on  G  is  defined  as  the 
decision  tree  whose  nodes  at  the  i-th  level  represent  func¬ 
tions 

/(*,)  =  £  ZV(S/(tn)R., (*«)).  (3) 

WitTi 

where  I\  is  the  dual  object  of  Gi,  and  R«,.  are  the  unitary 
irreducible  representations  of  G{. 

In  Fourier  DT  for  a  given  /,  the  values  of  constant 
nodes  are  the  Fourier  coefficients  of  /.  The  Fourier  coeffi¬ 
cients  corresponding  to  the  group  representations  with  or¬ 
ders  rw  >  1,  w  =  1, . . K  —  1,  are  (rw  x  r^,)  matrices. 
Therefore,  Fourier  DTs  on  finite  non- Abelian  groups  (FN- 
ADDs)  are  matrix- valued  DTs,  since  some  of  their  con¬ 
stant  nodes  are  (r^  x  rw)  matrices.  The  matrix- valued 
coefficients  may  be  also  represented  by  the  DTs  on  Abelian 
groups  by  using  the  method  of  representation  of  matrices  by 
decision  diagrams  [3].  In  that  way,  we  derive  the  number¬ 
valued  FNADDs.  In  this  paper,  we  consider  number-valued 
FNADDs,  since  we  do  comparison  with  DDs  on  Abelian 
groups  that  are  the  number- valued  DDs. 

4.2  Fourier  decision  diagrams 

Fourier  decision  diagrams  are  derived  by  the  reduction  of 
Fourier  decision  trees.  The  reduction  is  performed  by  us¬ 
ing  generalized  BDD  reduction  rules  introduced  for  the  re¬ 
duction  of  the  Walsh  transform  DDs  (WDDs),  which  are 
Fourier  DDs  on  finite  dyadic  groups  [17]. 

Definition  4 *  Fourier  decision  diagrams  are  DDs  derived 
from  the  Fourier  decision  trees  by  using  the  generalized 
BDD  reduction  rules.  A  Fourier  decision  diagram  is  re¬ 
duced  if  further  reduction  with  the  same  rules  is  impossible. 

5.  Complexity  of  FNADDs 

In  this  section,  we  compare  various  DDs  on  dyadic  groups 
with  FNADDs  on  the  quaternion  groups.  Multiple-output 
functions  are  represented  by  Shared  BDDs  (SBDDs), 
Multi- terminal  binary  DDs  (MTBDDs)  [11],  and  FNADDs. 
For  representation  by  MTBDDs  and  FNADDs,  they  are 
first  represented  by  the  integer- valued  functions  fz .  De¬ 
pending  on  the  value  of  n,  in  FNADDs  on  quaternion 
groups  C2Q2  and  C4Q2,  we  use  as  the  root  node  the  Shan¬ 
non  nodes  with  two  and  four  outgoing  edges. 

Table  V  compares  sizes  and  widths  of  SBDDs  and  FN¬ 
ADDs  for  various  benchmark  functions.  For  FNADDs  the 
values  of  non- terminal  nodes  (ntn)  and  constant  nodes  (cn) 
are  shown  separately.  Thus,  the  size  of  FNADDs  is  the  sum 
of  these  two  values.  This  table  shows  that,  besides  depth 
reduction,  we  get  the  width  reduction,  except  for  5xpl.  Ex¬ 
cept  this  function  and  coni,  the  area  is  also  reduced.  In 
FNADDs,  rn  is  quite  more  convenient.  For  example,  in 
xor5,  where  the  size  and  width  remain  the  same,  it  is  5.5 
and  0.83  for  SBDDs  and  FNADDs,  respectively. 
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TABLE  V 

SBDDs  and  FNADDs  for  benchmark  functions. 


SBDD 

FNADD 

/ 

size 

width 

a 

BSQ3I 

a 

mm 

group 

5xpl 

90 

25 

2250 

mi 

n 

~cW 

bw 

116 

37 

4292 

m 

9 

34 

136 

9 

cA(h 

coni 

20 

5 

100 

13 

12 

25 

mm 

1.83 

c2q\ 

rd53 

25 

6 

150 

7 

14 

21 

mm. 

0.50 

C4Q2 

rd73 

45 

10 

450 

23 

30 

53 

318 

0.70 

W2Q\ 

xor5 

11 

2 

22 

5 

6 

11 

22 

0.83 

C4Q2 

Table  VI  compares  sizes  of  SBDDs  and  FNADDs  for  n- 
bit  adders.  In  this  example,  FNADDs  provide  the  reduc¬ 
tion  of  all  three  parameters,  depth,  size,  and  width  and 
the  reduction  is  considerable  compared  to  SBDDs.  The  re¬ 
duction  possibilities  in  these  DDs  are  compared  through 
the  percent  of  used  nodes  from  the  total  of  nodes  in  the 
corresponding  DTs.  It  may  be  seen  that  these  values  are 
comparable,  thus,  the  possibility  to  do  reduction  in  SBDDs 
and  FNADDs  is  comparable.  In  FNADDs,  the  reduction  of 
area  is  considerable  and  shows  advantages  of  them. 

Table  VII  compares  the  FNADDs  to  some  other  DDs 
based  on  spectral  transforms.  The  Arithmetic  transform 
DDs  (ACDDs),  Walsh  transform  DDs  (WDDs)  in  (1,-1) 
coding  [17],  and  Complex-Hadamard  Transform  DDs  [4] 
are  compared  with  FNADDs  for  some  benchmark  func¬ 
tions.  ACDDs,  WDDs,  and  CHTDDs  are  DDs  on  Abelian 
groups.  Therefore,  the  depth  is  equal  to  n.  We  do  not 
use  peculiar  properties  of  complex  spectra  for  switching 
functions  in  CHTDDs,  nor  FNADDs  to  further  reduce  the 
depth  of  these  DTs.  For  rd53  and  xor5,  the  sizes  of  FN¬ 
ADDs  are  equal  to  those  of  other  DDs.  However,  the  num¬ 
ber  of  non- terminal  nodes  and  the  width  are  reduced.  In 
other  cases  in  this  example,  FNADDs  are  more  efficient 
with  respect  to  depth,  width  and  size.  However,  as  for 
other  DDs,  some  examples  where  FNADDs  are  not  ef¬ 
ficient  certainly  can  be  found.  As  an  example  consider 
the  Achille’s  function  /  =  xxyx  V  x2y2  V  •••  V  x2ry2T< 
Table  VIII  shows  the  sizes  of  FNADDs  for  Achille’s  heal 
function  for  different  values  of  n  and  for  two  differ¬ 
ent  orderings  of  variables  &i ,  £2, . . .  f  #2r 1 2/1  >  S/2i  •  " » !/2r  and 
x\ ,  2/i  >  2/2,  *  •  •  ,  xnyn  .  This  example  shows  that,  as  in  any 
other  DDs,  the  sizes  of  FNADDs  greatly  depends  on  the 
initial  variables  ordering.  In  this  example,  the  depth  reduc¬ 
tion  in  FNADDs  is  achieved  at  the  price  of  the  increased 
size  of  FNADDs. 

6.  Closing  Remarks 

For  large  switching  functions, 

1.  Quaternion  groups  permit  considerable  reduction  of 
time  complexity  of  FFT  at  the  price  of  comparatively 
small  increase  of  space  complexity. 

2.  FNADDs  on  quaternion  groups  permit  depth  and 
width  reduction  simultaneously.  In  many  cases,  they 


TABLE  VIII 

BDDs  and  FNADDs  for  Achilles'  heal  functions. 


n  =  2r 

BDD 

FNADD 

worst 

best 

worst 

best 

decomposition 

4 

8 

6 

13 

12 

C2Q2 

6 

12 

8 

37 

25 

Q\ 

8 

14 

10 

71 

40 

C4QI 

have  smaller  size  than  DDs  on  dyadic  groups.  FN¬ 
ADDs  have  smaller  area  and  permit  reduction  of  the 
number  of  non-terminal  nodes  at  the  price  of  increased 
number  of  constant  nodes. 

3.  Both  properties  of  quaternion  groups  increase  when 
the  number  of  variables,  equivalently,  the  order  of  do¬ 
main  groups  increases. 

It  follows  that,  compared  to  dyadic  groups,  the  quaternion 
groups  have  advantages  as  domain  groups  for  large  switch¬ 
ing  functions.  Extensions  to  multiple- valued  functions,  are 
straightforward.  For  example,  the  quaternary  functions  are 
defined  on  C” ,  or  alternatively  on  the  quaternion  groups 
of  the  order  4n.  However,  unlike  Walsh  transform  on  C2i 
the  Fourier  transform  matrix  on  C\  involves  complex  unit 
i  and  unlike  the  Fourier  transform  matrix  on  Q2  does  not 
have  any  zero  element.  Therefore,  compared  to  ,  ad¬ 
vantages  of  Fourier  transform  on  quaternion  groups  over 
Fourier  transform  on  CJ  increase. 
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TABLE  VI 

SBDDs  and  FNADDs  for  adders  and  multipliers. 


adders  | 

SBDD 

| FNADD  1 

n 

size 

width 

% 

a 

ntn 

cn 

size 

width 

% 

a 

rn 

2 

8 

21 

22.00 

168 

4 

7 

11 

2 
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57 
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TABLE  VII 

Sizes  of  ACDDs,  WDDs,  CHTDDs  and  FNADDs. 


ACDD 

CHTDDs 

/ 

ntn 

cn 

size 

width 

_ « 

rn 

ntn 

cn 

size 

width 

a 

rn 

5xpl 

38 

11 

49 

3.45 

127 

128 

255 

64 

■fjtjvzyl 

0.99 

bw 

31 

32 

63 

■9 

31 

32 

63 

16 

0.96 

coni 

40 

5 

45 

mm 

1 

8 

115 

42 

157 

56 

8792 

2.74 

rd53 

15 

6 

21 

■9 

2.5 

15 

6 

21 

5 

105 

2.5 

rd73 

27 

6 

33 

6 

188 

4.5 

27 

8 

36 

7 

252 

3.37 

xor5 

15 

6 

21 

5 

2.5 

9 

2 

11 

2 

22 

4.5 

WDD(1,-1) 

FNADD  | 

/ 

ntn 

width 

« 

rn 

ntn 

cn 

a 

m 

5xpl 

37 

13 

IrU 

9 

■ 

2.85 

39 

BE  »1 

H 

MdiiiM 

n 

bw 

31 

32 

1 

16 

9 

iM 

34 

H 

136 

■El 

coni 

48 

11 

13 

767 

4.36 

13 

25 

6 

1.08 

rd53 

15 

5 

20 

5 

7 

R  9 

21 

63 

0.50 

rd73 

28 

5 

33 
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Abstract 

In  this  paper  we  present  a  new  axiomatization  of  the 
notion  of  entropy  of  functions  between  finite  sets  and 
we  introduce  and  axiomatize  the  notion  of  conditional 
entropy  between  functions .  The  results  can  be  directly 
applied  to  logic  functions ,  which  can  be  regarded  as  func¬ 
tions  between  finite  sets.  Our  axiomatizations  are  based 
on  properties  of  entropy  with  regard  to  operations  com¬ 
monly  applied  to  discrete  functions  and  are  related  to  the 
usage  of  entropy  as  a  measure  of  the  energy  dissipated 
by  circuits  that  implement  discrete  functions . 

Keywords:  functions  between  finite  sets,  logic  func¬ 
tions,  entropy,  axiomatization,  conditional  entropy. 

1  Introduction  and  Basic  Notations 

This  paper  is  concerned  with  the  axiomatization  of  the 
notion  of  entropy  for  finite  functions.  This  type  of 
numerical  characteristic  of  functions  has  been  shown 
(cf.[l])  to  be  related  to  the  complexity  of  realization 
of  boolean  functions.  It  has  also  been  noted  [3],  that 
power  dissipation  of  a  circuit  realizing  a  binary  function 
is  proportional  to  its  entropy.  Information  measures,  es¬ 
pecially  conditional  entropy  of  a  logic  function  and  its 
variables,  have  been  used  for  minimization  of  logic  func¬ 
tions  [4].  A  study  of  information  estimations  for  logic 
functions  can  be  found  in  [2].  Axiomatizations  of  en¬ 
tropies  for  random  variables  can  be  found  for  example 
in  [5].  The  axiomatization  of  Shannon  entropy  for  func¬ 
tions  considered  in  this  paper  makes  use  of  the  opera¬ 
tions  that  are  naturally  encountered  in  building  circuits 
(parallel  and  serial  connections  of  functional  modules) 
and  represents  a  simplification  of  some  of  our  previous 
work  [6].  Further,  we  extend  the  notion  of  conditional 
entropy  of  a  function  and  its  arguments  to  conditional 
entropy  between  functions  and  we  give  an  axiomatiza¬ 
tion  of  this  notion. 
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Unless  otherwise  specified,  all  sets  considered  in  the  fol¬ 
lowing  discussion  are  nonempty  and  finite.  All  loga¬ 
rithms  are  in  base  2.  In  the  discussion  we  assume  that 
0  •  logO  =  0.  The  domain  and  range  of  a  function  / 
are  denoted  by  Dorn  (/)  and  Ran(/)  respectively.  The 
restriction  of  function  /  to  a  set  A  C  Dom  (/)  is  de¬ 
noted  by  fA •  Below  we  introduce  some  operations  on 
functions  between  finite  sets. 

Let  J  be  the  class  of  all  functions  between  finite, 
nonempty  sets.  Define  the  partial  order  Q  on  J  by 
/  C  g  if  /  :  A  — ■>  B,  g  :  A'  — >  B',  A  C  A',  B  C  B' 
and  /(g)  —  g(p)  for  a  €  A.  Note  that  if  A,B,C,D  are 
finite  sets,  such  that  AC\  B  —  C  C\  D  =  f  :  A  >  B , 
and  g  :  C  — ►  D,  then  there  exists  sup {/,  p}-  We  define 
the  function  /  U  g  =  sup{/,#}  by  setting 

/ ,  w  v  f  fix)  if  x  €  A 

</u»x.)-|jy  axeB 

It  is  easily  verifiable,  that,  whenever  it  is  defined,  the 
“LI”  operation  is  commutative  and  associative. 

The  composition  of  functions  /  :  A  ->  B  and  g  :  B  ->  C 
is  a  function  gf  :  A  C,  such  that  gf(x)  =  g{f(x))> 
for  every  x  €  A. 

The  Cartesian  product  of  functions  /  :  A  -*  B  and 
g  :  C  ->  D  is  a  function  /  x  g  :  Ax  C  -» 1?  x  D  defined 
as  (/  x  g)(x}y)  =  (f(x),g(y)),  for  every  x  €  A  and 

y  €  C. 

In  the  following  sections  we  will  also  use  the  following 
definition: 

Definition  1.1  Let  Au  M,- .  •  ,  An  be  n  pairwise  dis¬ 
joint  subsets  of  a  finite  set  A.  The  generalized  charac¬ 
teristic  function  of  sets  Aj,  A2,  • .  *  >  An  is  the  function: 
XAirAt,... :  Ai  UA2 U . .  -U  An  -»•  {1, 2, . . .  , n}  defined 
by 

Xaua2,...,a„(x)  =  i 
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if  and  only  if  x  E  Ai,  for  1  <  i  <  n.  □ 

Note,  that  the  generalized  characteristic  function  is  also 
a  function  between  finite  sets. 

2  Axiomatic  definition  of  functional  en¬ 
tropy 

In  this  section  we  introduce  an  axiomatic  definition  of 
entropy  of  a  function  between  finite  sets. 

Definition  2.1  Let  T  be  the  class  of  all  functions  be¬ 
tween  finite,  nonempty  sets,  and  H  :  T  -»  R  be  a  func¬ 
tion  assigning  a  real  number  to  every  /  E  T.  Function 
H  is  called  entropy  of  functions  between  finite  sets  if 
and  only  if  it  satisfies  the  following  axioms: 

(El)  H(fa)  =  i/(/),  for  every  function  /  :  A  -¥  B 
and  bijection  a  :  A!  -*  A. 

(E2)  H(gf )  <  H(f ),  for  every  /  :  A  -»  B  and  g  :  B 
C. 

(E3)  If  A ,  C  are  finite  sets  with  \A\  <  \C\,  a  :  A  B 
and  0  :  C  -*  D  are  bijections,  then  H(a)  <  H{0). 

(E4)  If  /  :  A  -»  B  and  f  :  C  -¥  D  are  functions 
between  finite  sets  and  AnC  =  B  H  D  =  0,  then 

H('U9)  =  Wc\H^  +  +  H <*«>• 

(E5)  H (f  x  g)  =  H(f)  +  H(g)  for  all  functions  f  :  A 
1 3  and  g  :  C  -4  D. 

□ 

We  now  prove  several  consequences  of  the  above  axioms. 

Lemma  2.2  If  f  :  A  f(A)  C  B  and  /'  :  A  -4  By 
such  that  f(x)  =  /'(x)  for  every  x  E  A,  then  H(f)  = 

H(f’). 


Lemma  2.3  If  f  :  A  -¥  B  is  a  function  and  0  :  B  -4 
B '  is  a  bijection ,  then  H(0f)  =  H{f). 

Proof.  Note  that  /  =  0~l(0f).  Using  (E2)  we  obtain 
H(f)  =  <  H(f3f)  and  H{pf)  <  H(f), 

which  gives  H(0f)  =  H(f).  I 

Lemma  2.4  The  entropy  of  a  constant  function  is  0. 

Proof.  Let  /  :  A  B  be  a  constant  function.  Choose 
a  set  C  and  a  constant  function  g  :  C  D,  such  that 
\C\  =  |j4|  and  An  C  =  B  fl  D  =  0.  Notice  that  there 
exists  a  bijection  7,  such  that  /  U  g  =  7 xa  c,  so  H (/  U 
9)  =  H(Xa,c).  FVom  (E4)  we  ha veH(fUg)  =  \H{})  + 
\H(g)  +  H(xa,c)>  which  gives  H(f)  +  H(g )  =  0.  But, 
since  \A\  =  \C\  and  \B\  =  \D\  =  1,  there  exist  bijections 
a  and  0,  such  that  g  =  0 /a.  Using  (El)  and  lemma 
2.3  we  get  H(g)  -  (/),  so  H(f)  =  0.  I 

Lemma  2.5  The  entropy  of  a  bijection  depends  solely 
on  the  cardinality  of  its  domain. 

Proof.  Let  /  :  A  B  and  g  :  C  -4  D  be  any  bijections, 
such  that  \A\  =  |C7|.  Since  /  and  g  are  bijections,  we 
have  |I?|  =  |D|.  Consequently,  there  exist  two  bijections 
a  :  A  C  and  0  :  D  B  such  that  /  =  0ga.  By 
axiom  (El)  and  Lemma  2.3  we  get  H(f)  =  H(g).  I 

Denote  by  an  a  bijection  for  which  |Dom  (an)|  =  n.  A 
consequence  of  the  above  lemma  is  the  existence  of  a 
function  :  N  -4  E  from  the  set  of  natural  numbers 
into  the  set  of  real  numbers,  such  that  hjv(n)  —  H(an). 

Lemma  2.6  Let  f  :  A  B  be  a  function  between  two 
finite  sets  and  let  B  =  {&i, ...  ,  6m).  The  entropy  of  the 
function  f  is  given  by 

i= i  ‘  ’ 


Proof.  Let  g\  :  f(A)  B  and  g2  :  B  ->  f(A)  be  func¬ 
tions,  such  that  </i(x)  =  x,  for  all  x  E  f(A)  and  gz(x)  = 
x,  if  x  E  f(A)  and  52^)  =  a,  for  x  $  f(A ),  where  a  is 
any  element  of  f(A ).  Clearly,  /  =  g2f(  and  f  =  gif. 
Using  (E2)  we  obtain  H(f)  =  H{g2fl)  <  H(f)  and 
H(f)  =  H(gif)  <  H(f),  which  gives  H(f)  =  H(f').  I 

It  follows  from  the  above  lemma,  that  only  surjective 
functions  need  to  be  considered.  In  the  discussion  below 
we  assume  all  functions  to  be  surjective,  unless  other¬ 
wise  specified. 


where  Aj  =  {a  E  A\f(a)  =  bj}  =  f  1(bj)  for  1  <  j  < 
m. 


Proof.  Consider  functions  f\  :  C\  ->  D\  and  f2  -  C2  ^ 
D2 ,  where  C\  fl  C2  =  D\  O  D2  =  0  also,  let  /  =  f\  LJ  f2 
and  /  :  C  -*  D,  C  =  C\  U C2,  D  =  D\  U L>2.  Denote  by 
/1  :  C  -4  Di  U  {d},  d  E  D2j  a  function  such  that 


if  x  E  Ci 
if  x  E  (72 
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It  follows  from  (E4)  that: 

H(fi)  =  +  ^H{fd)  +  H(xcuci) 

where  fd  is  the  constant  function  fd(x)  =  d  for  a;  E  (72. 

Applying  (E4)  to  /  and  substituting  the  above  equa¬ 
tion,  we  obtain: 

H(f)  =  H{h)  +  (1) 

Let  f  :  A  — y  B,  B  —  {&i5 . .  •  5  bm }  and  A{  —  {x  E 
A\f(x)  =  b{}.  The  function  /  can  be  decomposed  using 
the  “LI”  operation  and  constant  functions  as  /  =  Ja1  U 
*  *  *  LI  f Am  •  Also,  let  a**)  :  Aj  -»  Ei  be  a  bijection  onto 
some  set  E*,  F*  D  £  =  0,  and  E{  n  Ej  =  0  for  every 
i  ^  j.  Consider  a  sequence  of  functions  Fo,  F\ , . . .  ,  Fm 
such  that 

Fo  =  f,  Fj  =  U  ...  U  a«>  U  /,»i+1  U  . . .  U  fAm 
for  j  >  0. 

Clearly,  Fm  :  A  ->  (J  Ei  is  a  bijection.  By  substituting 
fAl  □  *  •  •  U  j  Am  and  into  formula  (1)  in  place  of  f\ 
and  /2  respectively,  we  obtain: 

H(F0)  =  H(F1)-lj^H(aU). 

The  same  procedure  is  then  applied  iteratively  to 
if(Fi),if(F2),...  ,if(Fm).  E.g.,  to  decompose  H{Fj) 
we  substitute  U  •  •  •  U  U  fAj+1  LJ . . .  U  fAm  and 
a(i+i)  in  place  of  f\  and  /2  respectively,  into  formula 
(1).  By  applying  this  procedure  m  times,  the  following 
equation  is  obtained: 

H(f)  =  H(F0)  =  H(Fm)-jr^H(aW) 

j- 1 

=  Mm)-E  mtMMjD- 

j= 1  1  1 

I 

Lemma  2.7  hj^(n)  =  rlog(n),  where  r  E  M,  r  >  0  is  a 
constant 

Proof.  Let  a*  =  an  x  •••  x  an.  Let  us  prove  that 

- - * - - 

k  times 

if  (a £)  =  kH(an ),  for  k  >  1.  If  k  =  1  the  case  is  trivial. 


Suppose,  that  the  hypothesis  holds  for  some  k  >  1,  we 
have  if  (a*+1)  =  H(ak  x  an),  from  (E5)  it  follows  that 
H(ak+l)  =  H{akn)  +  H{an)  =  (k  +  l)if(an),  which 
proves  the  hypothesis. 

Since  a  Cartesian  product  of  bijections  is  also  a  bijec¬ 
tion  we  have  hjv(nk)  =  fc/i;v(n).  On  the  other  hand, 
it  follows  from  (E3),  that  hw  is  nondecreasing.  It  is  a 
functional  result  that  a  function  hjv  :  N  -»  R,  satisfying 
the  two  conditions  has  the  form: 

hN(n)  =  rlog(n), 

where  r  E  R,  r  >  0  is  a  constant.  A  proof  can  be  found, 
for  example,  in  [5].  I 

Later  on,  we  will  explicitly  assume  r  =  1.  This  could 
also  be  achieved  by  introducing  an  additional  normal¬ 
ization  condition  if(a2)  =  1. 

We  are  now  ready  to  prove  the  following  theorem: 


Theorem  2.8  The  entropy  of  a  function  f  :  A  -¥  B 
between  two  finite  sets,  B  =  {61, . . .  ,  6m}  has  the  form : 


w-tm  ><* 

i~  1  1  ' 


\Ai\ 

\A\  ' 


where  A;  ~  {a  E  A\f(a)  =  bA. 


Proof.  By  applying  Lemma  2.6  to  the  function  /  and 
using  Lemma  2.7  we  get 


H(f)  =  logl^l-E^logMM 


3  Axiomatic  definition  of  conditional 
entropy 

In  this  section  we  introduce  an  axiomatic  definition  of 
the  conditional  entropy  between  functions. 

Definition  3.1  Let  T  be  the  class  of  all  functions  be¬ 
tween  finite,  nonempty  sets.  Denote  by  V  a  class  of  pairs 
of  functions  having  identical  domains  V  =  {(/,#)  E 
T  x  F|Dom  (/)  =  Dom  ( g )}.  Let  He  :  V  R  be  a 
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function  assigning  a  real  number  to  every  pair  of  func¬ 
tions  (/,#)  €  V.  Hcif^g)  will  be  denoted  below  as 
H(f\g).  Function  Hc{f>g)  is  called  conditional  entropy 
of  a  pair  of  functions  (f,g)  if  and  only  if  it  satisfies  the 
following  axioms: 


Repeating  further  the  above  process  we  get 


(CEl)  H(f\c)  =  H(f)  for  every  constant  function  c. 

(CE2)  H(f\9l  U g2)  =  lj ^H(fAl\9l)  +  l$H(fA2\g2), 
for  all  /  :  A  — >  B,  gi  :  A\  — >  Ci,  02  *  A2  — >  C2, 
A\  O  A2  =  Ci  D  C2  =  0,  Ai  U  A2  =  A. 

□ 


We  now  examine  some  of  the  properties  of  conditional 
entropy  defined  above. 

Theorem  3.3  H(f\f)  —  0,  for  every  function  between 
finite  sets  f  :  A->  B,  B  =  {61? . . .  ,  6m}. 

Proof.  From  Theorem  3.2  we  obtain: 


Theorem  3.2  Conditional  entropy  between  functions 
f  :  A  B  and  g  :  A  C, where  B  =  {61,...  = 

{ci,...  ,cn},  has  the  following  form: 


where  A )  =  {a:  €  A|/(x)  =  b{  A  </(x)  =  cj),  Aj  =  {a:  € 
A\g(x)  =  Cj}. 


Proof.  We  decompose  g  as  g  =  g^l  U  . . .  LJ  g^n ,  where 
Pa4i ,  •  •  •  ,  gAn  are  constant  functions.  Let  A!  =  A2  U. .  .U 
An .  Using  (CE2)  we  get 


H(f\g)  =  l^H(fAl\gAl)  + 

Since  gAl  is  a  constant,  using  (CEl)  and  Theorem  2.8 
we  get 


mm  = 


^mfA1)  +  ^H(fA'\gA') 


_  1*1  v*  I4| . 


mt 


|A|‘S|*| l0g  \a\\  +  \a\HUa’\9a') 


L  \A\  g|Ax|  +  \A\H^a'\ 9a') 


Let  A"  =  A3  U  . . .  U  A„.  Applying  the  above  procedure 
to  H(fA>\gA>)  yields 


H(f\g) 


h  14  141 

+^j^H{fAn\gA„) 


vMi 

h  mi 


*(/i/)  =  -££^,  (2) 

where  A)  =  {x  e  A\f(x)  =  A  /(x)  =  bj],  Aj  =  {x  € 
A\f(x)  =  fy}.  Since 


4*. 


f  0  if i  #  j 

\  Aj  if  i  =  j, 


by  substituting  the  above  into  equation  (2)  we  obtain 
the  desired  result.  | 

Consider  the  functions  f  :  A  B  and  g  :  yt  (7, 
where  2?  =  {61, . . .  , 6m}  and  C  =  {ci, . . .  , cn}.  Let  us 
denote  by  (f\g)  :  A  -»  B  x  C  the  function  defined  as 
(/;p)(®)  =  (/(a;), 0(a))  for  every  a  G  A. 


Theorem  3.4  We  have  H(f;g)  =  H(f\g)  +  #(0)  = 


Proof.  Let  us  prove  the  first  part  of  the  equality.  We 
have: 


fT(r  \  ST  1^51 1  l^jl 

=  ~  £  uTloe-rtT 


Ml  14 


=  -YY  LIHiogMil  Mai 

fee  Ml  s  1*1  Ml 

=  f  MiilogMii 

Uk  Ml  1  g  1*1 

=  ir(/|p)  +  jy(p), 
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where  A J  —  {x  €  A|/(x)  —  bi  A  <?(x)  =  Cj},  Aj  = 
{x  €  A|0(x)  =  Cj}.  The  proof  of  the  equality  H(f;g)  = 
H{g\f)  +  H(f)  is  analogous.  I 

Note  that  the  above  two  theorems  correspond  to  well 
known  theorems  about  Shannon  entropy  of  random  vari¬ 
ables,  where  H(f;g)  corresponds  to  the  joint  Shannon 
entropy  of  random  variables. 

Also,  our  definition  of  conditional  entropy  between  two 
functions  is  a  generalization  of  the  notion  of  conditional 
entropy  between  a  multiple-valued  logic  function  and 
one  of  its  variables,  which  was  introduced  in  [2]. 

Let  . . .  ,  A M  and  B  be  finite  sets  and  A  =  A W  x 
...  x  A^n\  We  denote  by  7r *  the  projection  of  the  set 
A  onto  A^\  Let  f(x i,...  ,xn)  :  A  5,  where  x*  € 
A W  be  a  multiple-valued  logic  function.  The  entropy 
of  function  /  conditioned  on  one  of  its  variables,  say 
Xi  €  A^\  is  defined  as  H(f\ni).  If  B  =  {61,...  ,6m}, 
and  A^  =  {ai, . . .  ,an}  we  have: 


J— 1  X—  1  11  1  J 


where  Aj  =  {x  €  A|/(x)  =  bi  A  Ki(x)  =  Oj},  A/  =  {xG 
A|7Ti(x)  =  aj},  which  corresponds  to  the  definition  given 
in  [2]. 


4  Conclusion 

Axiomatic  definitions  of  entropy  of  functions  between 
finite  sets  and  of  conditional  entropies  of  such  functions 
have  been  presented.  In  the  axioms  we  used  standard 
operations  applied  to  discrete  functions.  We  believe, 
that  the  paper  will  contribute  to  a  better  understand¬ 
ing  of  the  notion  of  entropy  of  functions  between  finite 
sets.  We  also  hope  that  our  results  will  find  applica¬ 
tions  in  decomposition  of  MVL  logic  functions  and  in 
the  estimation  of  power  dissipation  of  digital  circuits. 
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Abstract 

This  paper  presents  a  design  of  a  non-volatile  multiple¬ 
valued  content-addressable  memory  (MVCAM)  using 
metal-ferroelectric-semiconductor  (MFS)  FETs.  An 
MFSFET  is  an  important  device  with  a  non- destructive 
read  scheme.  Multiple-valued  stored  data  are  directly 
represented  by  remnant  polarization  states  that  corre¬ 
spond  to  threshold  voltages  of  an  MFSFET.  Since  one¬ 
digit  comparison  between  multiple-valued  input  and 
stored  data  is  performed  by  the  combination  of  two  dif¬ 
ferent  threshold  operations,  a  one- digit  comparator  can 
be  designed  by  two  MFSFETs.  The  use  of  the  one¬ 
digit  comparator  makes  it  possible  to  design  a  compact 
MVCAM  cell  It  is  evaluated  that  the  performance  of 
the  proposed  MVCAM  is  superior  to  that  of  some  bi¬ 
nary  and  multiple-valued  CAMs  in  terms  of  bit  density, 
peripheral- circuit  complexity,  access  speed,  and  func¬ 
tionality. 

1.  Introduction 

Real-time  programmable  and  non-volatile  memories 
with  a  one-transistor  cell  structure  are  one  of  the  key 
modules  in  the  present  high-performance  system  LSIs. 
Ferroelectric(FE)  memory  devices  have  increasingly  at¬ 
tracted  attention  because  they  are  non-volatile  mem¬ 
ories  with  the  capability  of  high-speed  read  and  write 
operations.  For  example,  FE  capacitors  are  about  ten- 
times  larger  than  that  of  the  Si02  film,  so  that  they 
have  a  potential  advantage  to  implement  high-density 
memories.  A  one-transistor  memory  cell  can  be  imple¬ 
mented  by  only  a  single  MFSFET  with  a  non- destructive 
read  operation. 

It  is  well  known  that,  in  the  present  giga-scale  system- 
on-a-chip  integration  era,  accelerating  the  evolution  in 
chip  density  causes  a  communication  bottleneck  be¬ 
tween  memory  and  logic  modules.  A  logic-in-memory 
structure,  in  which  storage  functions  are  distributed 
over  a  logic-circuit  plane,  is  a  key  technology  to  solve 


the  above  problem.  CAM  is  a  typical  application  of 
such  a  logic-in-memory  VLSI  architecture.  Until  now, 
several  high-performance  CAMs  have  been  reported 
[l]-[6].  However,  it  has  been  difficult  to  solve  the  trade¬ 
off  between  real-time  programmability  and  non¬ 
volatility  with  keeping  a  compact  CAM  cell  circuit.  A 
few  challenging  works,  a  MVCAM  using  FE  devices, 
have  been  reported[6]. 

In  this  paper,  a  new  design  of  an  MVCAM  using 
non-destructive  FE  devices  such  as  MFSFETs  is  pro¬ 
posed.  Multiple- valued  stored  data  in  the  MVCAM 
cell  are  directly  represented  by  threshold  voltages  of 
an  MFSFET  that  can  be  programmed  by  controlling 
remnant  polarization  states.  Multiple- valued  one-digit 
comparison  is  performed  by  the  logical  product  AND 
of  two  different  threshold  operations.  Since  the  use  of 
precharge- evaluate  logic  makes  it  possible  to  implement 
a  two-input  AND  gate  by  just  wiring,  an  MVCAM  cell 
can  be  designed  by  two  MFSFETs.  An  MVCAM  word 
circuit  is  also  designed  by  series  connection  of  the  cell 
circuits  together  with  precharge-evaluate  logic,  so  that 
an  n-digit  MVCAM  word  circuit  can  be  realized  by  2 n 
MFSFETs.  Moreover,  the  performance  of  the  proposed 
MVCAM  is  compared  with  that  of  a  binary  dynamic 
CAM[2]  and  non-volatile  M  VC  AMs  [3],  [5],  [6]  in  terms 
of  non-volatility,  real-time  programmability,  a  bit  den¬ 
sity,  an  execution  speed,  and  peripheral- circuit  simplic¬ 
ity.  As  a  result,  it  is  demonstrated  that  its  performance 
is  superior  to  that  of  the  other  CAMs. 

2.  Review  of  MFSFETs 

2.1  Characteristics  of  MFSFETs 

An  MFSFET  structure  is  similar  to  a  conventional 
metal-oxide-semiconductor(MOS)  FET  with  the  excep¬ 
tion  that  the  gate  insulating  layer  is  replaced  by  an 
active  ferroelectric  thin  film.  Figure  1  shows  a  cross- 
sectional  view  and  a  symbol  of  an  MFSFET.  An  FE 
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Figure  1.  MFSFET  structure,  (a)  Cross-sectional  view, 
(b)  Symbol. 


Figure  2.  Perovskyte  structure. 

material  consists  of  a  Perovskyte  structure  as  shown  in 
Figure  2,  which  is  transformed  when  an  atom  at  the 
center  of  a  lattice  causes  a  polarization  by  a  shift  from 
an  electrically  neutral  state  according  to  the  external 
electric  field  which  is  determined  by  an  applied  voltage 
and  thickness  of  FE  thin  films.  This  characteristic  is 
represented  by  a  hysteresis  loop.  Figure  3  shows  po¬ 
larization  vs  voltage  hysteresis  loops  of  the  FE  film. 
One  of  the  most  important  characteristics  in  the  FE 
material  is  to  have  a  remnant  polarization  within  the 
saturated  hysteresis  loop  when  the  voltage  across  the 
FE  film  is  zero.  It  implies  that  the  FE  film  can  pre¬ 
serve  information  that  corresponds  to  a  remnant  po¬ 
larization  state  without  voltage  supply,  i.e.  an  MFS¬ 
FET  is  capable  of  a  non-volatile  memory  device.  Re¬ 
cently,  MFSFETs  have  been  reported  using  the  variety 
of  the  FE  material  such  as  Bi4Ti30x2[7],  LiNb03[8], 
and  SrBi2Ta209[9]. 

Figure  3  also  shows  an  example  of  a  binary  data  as¬ 
signment  with  remnant  polarization  states.  As  the 
remnant  polarization  is  determined  by  an  applied  volt¬ 
age  across  the  FE  film,  whose  magnitude  is  larger  than 
the  coercive  voltage  Vc,  a  logic  state,  “1”  or  “0”,  can 
be  written  into  the  MFSFET  by  the  gate  voltage  Vg 
and  the  bulk  voltage  Vb-  The  required  voltage  applied 
to  gate  electrode  of  the  MFSFET  to  write  data  is  lower 


Figure  3.  Hysteresis  loop  characteristics  of  an  FE  mate¬ 
rial. 


Figure  4.  Compensation  charge  in  the  semiconductor 
surface. 

than  that  of  conventional  non-volatile  memory  devices 
such  as  a  floating-gate  MOS  transistor. 

When  the  external  voltage  is  removed,  the  remnant 
polarization  will  induce  an  electric  field  which  attracts 
positive  (or  negative)  compensation  charge  to  the  semi¬ 
conductor  surface  as  shown  in  Figure  4.  Therefore,  the 
carrier  density  of  a  semiconductor  at  the  interface  will 
be  inverted,  which  makes  it  possible  to  shift  the  thresh¬ 
old  voltage.  Figure  5  shows  the  threshold  voltage  corre¬ 
sponding  to  the  MFSFET  state  “0”  and  “1”.  In  a  read 
scheme,  Vb  is  fixed  to  the  ground  level.  When  read 
gate  voltage  V rf  is  applied  to  Vg,  the  switching  state 
in  the  MFSFET  depends  on  the  polarization  state  in 
the  FE  film,  i.e.  stored  data.  Therefore,  stored  data  in 
the  MFSFET  is  read  as  “on”  and  “off”  states.  As  Vrf 
is  low  enough,  the  polarizations  state  in  the  FE  film 
remains  the  previous  state.  Therefore,  the  MFSFET 
can  realize  a  non-destructive  read  operation. 

Consequently,  an  MFSFET  is  capable  of  being  used 
as  a  non-volatile  memory  device  with  a  non-destructive 
read  scheme.  Principle  of  the  operation  is  similar  to  a 
Floating-Gate  MOS  Transistor,  but  an  MFSFET  has 
the  advantage  of  a  high-speed  and  a  low  power  write 
scheme  due  to  the  characteristic  of  FE  films. 

2.2  Multi-level  threshold  programming  in 
MFSFETs 

In  an  MFSFET,  a  stored  value  is  represented  by 
remnant  polarization  states.  Therefore,  several  rem- 
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Figure  5.  Relationship  between  threshold  voltage  and  the 
remnant  polarization  states. 

nant  polarization  states  are  required  for  multiple- valued 
data  storage.  For  example,  Figure  6  shows  the  rela¬ 
tionship  between  four  remnant  polarization  states  and 
the  assignment  of  four-valued  data.  Four  remnant  po¬ 
larization  states  are  obtained  by  the  following  method. 
The  polarization  in  an  FE  film  moves  on  branches  as 
shown  in  Figure  4.  This  means  that  Pr o  and  —  Pro  are 
produced  by  the  transition  of  the  polarization  through 
a  — >  b  — ►  c,  and  d  — *  e  — ►  /,  respectively. 

Consider  the  storage  of  “2”  in  the  four- valued  logic 
into  MFSFETs.  Figure  7  shows  the  timing  diagram 
and  polarization  states  for  a  write  scheme.  First,  Vr\ 
is  applied  to  the  gate  electrode  of  the  MFSFET,  and 
then  zero  is  applied.  At  that  moment,  the  polarization 
stays  on  a.  Second,  —Vro  is  applied,  and  then  zero  is 
applied.  The  polarization  changes  from  a  to  c  through 
b.  Consequently,  the  remnant  polarization  in  FE  films 
is  Pr0,  i.e.  four- valued  data  “2”  is  stored. 

Threshold  voltages  are  shifted  depending  on  rem¬ 
nant  polarization  states.  If  there  are  four  remnant  po¬ 
larization  states,  we  can  utilize  the  four  level  threshold 
voltages.  This  characteristic  is  important  to  design  a 
CAM  cell  structure  using  MFSFETs. 


Figure  6.  Four  remnant  polarizations  states. 


Figure  7.  Timing  diagram  and  the  polarization  state  for 
a  write  scheme  of  a  multiple- valued  logic  value. 

3.  Design  of  a  Multiple-Valued  Content- 
Addressable-Memory  Using  MFSFETs 

CAM  accomplishes  a  magnitude  comparison  between 
two  kinds  of  R- valued  input  words,  5(an  n-digit  ex¬ 
ternal  input  word)  and  H2( an  ith  n-digit  stored  input 
word),  and  generates  a  binary  output  word,  Z(m- bit 
output)  as  its  comparison  result.  Figure  8  shows  a  gen¬ 
eral  structure  of  an  MVCAM.  In  the  case  of  i?-valued 
encoding,  an  input  word  5  and  the  xth  stored  word  Bi 
are  expressed  as  follows: 


n 


5  = 

j=  1 

(i) 

and 

Bi  =  Y,Rn~j -bij 

3= 1 

(2) 

where  sj  and  b{j(  1  <  j  <  n)  indicate  the  jih  digits  of 
S  and  jB;,  respectively,  and  $j ,  bij  €  {0, 1,  •  •  ■ ,  R  -  1}. 
si  and  bn  are  the  most  significant  digits,  and  sn  and 
bin  are  the  least  significant  digits,  respectively.  The 
magnitude  compassion  between  5  and  Bi  is  defined  as 

=  {  l  otherwise.'  <3> 

The  ith  binary  value  z2  in  Z  is  equal  to  G(S,  Bi). 

In  the  following  subsection,  we  discuss  about  a  hard¬ 
ware  algorithm  of  the  magnitude  comparison,  and  a 
circuit  design  for  a  high-performance  MVCAM  using 
MFSFETs. 

3.1  Hardware  algorithm 

The  magnitude  comparison  G(S,  Bi)  is  represented 
by  two  kinds  of  threshold  operations  for  each  digit.  One 
is  the  greater-than  search  operation,  gj ,  and  the  other 
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For  example,  let’s  consider  the  magnitude  comparison 
between  four- valued  three-digit  words,  S  and  B,  as 

S  =  (sis2s3)  —  (321), 

B  =  (hb  2h)  =  (320). 

If  B  is  greater  than  5,  then  one  of  the  following  condi¬ 
tions: 

(«)  9i  =  1, 

(b)  gel  A  g2  =  1, 
and  (c)  gex  A  ge2  A  g3  =  1 

must  be  at  least  satisfied.  In  this  example,  the  above 
condition  (c)  is  satisfied  because  of  ( Si  =  fq),  (s2  =  b2) 
and  (53  <63)- 

In  general,  the  magnitude  comparison  between  n- 
digit  words  can  be  represented  by  using  gj  and  gej  as 

G(S,Bi)  =  gi  V  (gex  A  g2)  V  (gei  A  ge2  A  ge3)  ■  •  ■ 

•  •  •  V  (ge  1  Age2A---  A  gen-i  A gn). (6) 


(b) 

Figure  9.  MFSFET-based  pass  gate,  (a)Model, 
(b)  Example. 


formed  into 


G(S,Bi )  =  gi  V  gel  A(g2  V  ge2  A(— 

■  -  -  (flf„_!  V  gen-i  A  flf„)  •  •  ■))■  (7) 

We  use  a  pass-transistor  net  work  [4],  [5]  to  design  the 
CAM  circuit  based  on  Eq.(7).  In  this  concept,  a  func¬ 
tion  f(x,y)  which  has  two  kinds  of  R- valued  inputs  x, 
y ,  and  binary  output  are  considered  as  a  pass  gate  as 
shown  in  Figure  9(a).  This  pass  gate  is  defined  as 


ffay) 


1 

0 


A  is  connected  to  B, 
otherwise. 


(8) 


The  result  of  f(x,y)  is  obtained  by  the  voltage  of  A 
which  is  precharged  to  Vm  before  operation.  Where, 
B  is  connected  to  the  ground  level.  The  voltage  of 
A  depends  on  the  result  of  /(x,  y).  That  is,  the  ML 
is  discharged  if  f(x,y)  results  in  “1”.  Otherwise,  the 
pass  to  discharge  is  broken  and  the  A  still  remains  Vm. 

Two  binary  logic  operators,  AND  and  OR,  can  be 
easily  realized  by  parallel  and  serial  connections  of  pass 
gates.  Figure  9(b)  shows  one  example  of  realization  of 
9\  A  ge2  V  g2  using  pass-gates  which  correspond  to  #i, 
<je2,  and  g2l  respectively.  Consequently,  G{S,B{ )  in 
Eq.(7)  is  realized  by  combination  of  gates.  Figure  10 
shows  a  block  diagram  of  the  MVCAM  word  circuit, 
in  which  an  MVCAM  cell  consists  of  pass  gates  which 
correspond  to  gj  and  ge j,  respectively. 


where  symbols  V  and  A  indicate  binary  logic  opera¬ 
tions,  OR  and  AND,  respectively.  Eq.(6)  is  also  trans¬ 


3.2  CAM  cell  circuit  design  using  MFSFETs 

Two  kinds  of  pass  gates,  gj  and  gej ,  can  be  designed 
using  threshold  operations[10]  between  the  threshold 


Si  S2  Sn-1 


Figure  10.  Block  diagram  of  an  MVCAM  word  circuit. 


ML:  Match  line 

Figure  11.  Realization  of  gj  and  gej  using  MFSFETs 

voltage  Vth  and  the  gate  voltage  Vg  of  an  MFSFET. 
The  threshold  voltage  of  the  MFSFET  represents  the 
storage  data.  This  means  that  the  MFSFET  can  be 
considered  to  a  pass  gate  which  has  two  kinds  of  input 
V<7  and  Vt#5  described  in  section  3.1.  An  external 
input  S{  and  a  stored  input  bij  correspond  to  Vg  and 
Vth  as  shown  in  Figure  11,  respectively. 

The  function  which  is  represented  by  the  pass  gate 
which  consists  of  an  MFSFET  is  determined  by  rela¬ 
tionship  between  s;  and  bij  (that  is,  Vg  and  Vth)- 
Figure  12  shows  relationship  of  the  gate  and  thresh¬ 
old  voltage  which  represents  Si  and  bij  with  regard  of 
gj  and  gej  in  the  case  of  four-valued  logic.  Using  this 
MFSFET,  the  CAM  can  be  simply  designed. 

3.3  Overall  structure  of  an  MVCAM 

Figure  13  shows  the  circuit  diagram  of  a  magni¬ 
tude  comparator  between  n-digit  words.  In  this  cir¬ 
cuit,  precharge-evaluate  logic  is  employed  to  provide 
low  power  dissipation  and  high  speed  processing  with 
less  area  overhead.  When  the  clock  0  is  in  a  low  state, 
the  match  line  is  precharged  to  Vm.  When  (j)  goes  to  a 
high  state,  the  match  line  is  discharged  depending  on 
the  relationship  between  5  and  Bi.  Data  input  for  each 
cells  are  performed  in  parallel,  and  result  of  compari¬ 
son  are  generated  to  za,  simultaneously.  Hence,  this 
MVCAM  accomplishes  a  read  scheme  in  one  step. 

Figure  14  shows  circuit  diagrams  for  write  opera- 


Gate  voltage  Vg 


Figure  12.  Four- valued  threshold  voltages  and  gate  volt¬ 
ages  of  MFSFETs. 


Vm 


tions  in  the  magnitude  comparator.  To  discharge  Vd 
and  Vs  for  each  MFSFETs,  the  voltage  “3”  is  applied  to 
MFSFETs  which  correspond  to  gj.  Then,  the  thresh¬ 
old  voltage  Vth  is  programmed  by  Vg  and  V&  as  shown 
in  Figure  7.  To  avoid  changing  the  threshold  voltage 
of  every  non-selected  MFSFETs,  the  digit  line  or  the 
plate  line  which  connects  with  those  MFSFETs  main¬ 
tains  floating. 

Table  1  summarizes  an  estimated  performance  of 
CAMs.  MFSFETs  can  realize  real-time  programmable 
and  non-volatile  memories  with  a  non-destructive  read 
scheme,  so  that  the  data  does  not  require  a  repro¬ 
gram  scheme.  Therefore,  its  peripheral  circuit  for  a 
reprogram  operation  can  be  reduced.  Moreover,  since 
an  MFSFET  can  perform  a  threshold  operation  be¬ 
tween  Vg  and  Vth  which  correspond  to  Sj  and 
respectively,  a  CAM  cell  is  designed  by  two  MFSFETs 
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Table  1.  Comparison  of  CAMs. 


Vrl 


Floating 


Figure  14.  Write  operation  in  a  digit-parallel  structure. 


based  on  pass-transistor  network.  Consequently,  the 
presented  MVCAM  can  achieve  the  high-speed  access 
scheme  and  high  density  . 

4.  Conclusion 

A  compact  MVCAM  with  real-time  programmabil¬ 
ity  has  been  designed  by  using  MFSFETs.  Since  a 
multiple- valued  threshold  operation  is  performed  by 
using  a  single  MFSFET  whose  multi-level  threshold 
voltages  correspond  to  remnant  polarization  states,  a 
multiple-valued  one-digit  comparator  can  be  designed 
by  just  two  MFSFETs.  Moreover,  the  use  of  the  pre¬ 
charge- evaluate  logic  circuit  makes  an  MVCAM  word 
circuit  compact. 

As  a  future  research  target,  it  is  also  important  to 
develop  high-performance  VLSI  circuit  with  multiple¬ 
valued  external  inputs,  multiple- valued  stored  inputs 
and  binary  outputs  using  MFSFETs. 
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Abstract 

Contributions  to  multiple-valued  threshold  logic  in  the 
seventies  are  reviewed  at  the  light  of  developments  in  the  area 
of  artificial  neural  networks .  It  is  shown  that  it  is  possible  to 
adapt  methods  of  design  of  feedforward  neural  networks  to 
generate  networks  of  multiple-valued  neurons  to  realize  any 
multiple-valued  function . 


1.  Introduction 

The  rediscovery  of  (and  the  publicity  given  to)  the  gradient 
descend  algorithm  to  tune  networks  of  especial  non-linear 
devices  called  " neurons  “  to  meet  specifications  of  a  problem 
based  on  representative  examples  of  performance  made  in  the 
middle  eighties  by  D.E.  Rumelhart  and  his  colleagues  [RHW 
86]  (see  however  [BrH  69],  [Par  85],  [Wer  74]),  has  lately 
motivated  the  MVL-community  to  retake  the  study  of 
multiple-valued  neurons  (see  e.g .  [ABJ  93],  [JBA  93],  [TIT 
95],  [Obr  96],  [NRS  98],  [MaC  98])  what  in  the  sixties  and 
seventies  was  simply  known  as  multiple-valued  threshold 
logic  (see  e.g.  [Han  63],  [Mer  64],  [AiA  70],  [Kit  70],  [SaA 
70],  [Mor  72],  [Mor  75],  [Mor  77],  [Mor  79c]).  The  renewed 
interest  in  this  area  has  however  a  different  focus,  that  opens 
new  research  questions  and  possibilities.  It  is  not  -(at  least  not 
yet)_  related  to  a  specific  hardware  realization  and  this  indeed 
has  allowed  facing  questions  that  in  the  former  decades  were 
very  soon  discarded.  On  the  other  hand  the  present  state  of 
affairs  has  not  yet  reached  the  level  of  generality  considered  by 
the  neural  networks  community.  It  seems  to  be  the  right  time 
to  point  out  some  common  aspects  of  interest  that  are  relevant 
to  both  the  MVL  and  the  NN-communities,  particularly  the 
question  of  designing  networks  of  (possibly  generalized) 
multiple-valued  neurons.  The  rest  of  the  paper  is  organized  as 
follows:  in  the  next  section  the  required  basics  will  be 
formalized.  Section  3  is  devoted  to  cover  the  relevant  aspects 
of  the  design  of  feedforward  neural  networks.  Representative 
examples  will  be  discussed  in  section  4.  The  paper  will  be 
closed  with  a  summary  of  conclusions. 


2.  Formal  background 

Definition  1 :  Artificial  Neural  Network 

An  artificial  neural  network  is  a  computational  model 
inspired  in  structures  existing  in  the  brains  of  mammals.  It 
consists  of  memoryless  elementary  processors  called  (by 
analogy)  ” neurons  “  and  links  interconnecting  them.  Links  are 
weighted  and  induce  a  signal  scaling.  Algorithms  are  known  to 
tune  the  weights  in  order  to  minimize  the  square  error  of 
performance. 

Definition  2:  Neuron 

A  neuron  (of  an  artificial  neural  network)  is  an  n-inputs, 
1 -output  elementary  processing  unit  that  represents  the 
composition  of  three  functions: 

•  an  input  function  /:  Cn  ->  C 

•  an  activation  function  A:  C  -»  C 

•  an  output  function  O:  C  C, 

where  C  denotes  a  compact  subset  of  the  reals. 

In  most  neural  networks  /  is  taken  to  be  an  affine 
transformation  and  it  computes  the  weighted  sum  of  the  input 
variables  with  possibly  a  bias.  Usually  A  is  chosen  to  be 
monotone.  In  that  case  it  may  be  hard  symmetric  or  asymmetric 
(sign  or  Heaviside  function,  respectively)  otherwise  soft  sym¬ 
metric  or  asymmetric  (htan  or  sigmoid ,  respectively).  The 
output  function  O  is  either  the  identity,  a  scaling  linear  func¬ 
tion  or  a  limiting  non-linear  one. 

Definition  2.1:  Multiple-valued  neuron 

A  multiple-valued  neuron  is  an  n-input,  1-output  device 
that  maybe  specified  as  the  composition  of  the  three  following 
functions: 

•  /:  (Zp)n  ->  R,  for  a  predefined  integer  p  >  1 

•  A'  R  — ^  Zp 

•  0\  Zp  — >  Zp 
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I  is  an  affine  function  computing  the  weighted  sum  of  the 
inputs,  where  the  weights  are  (in  principle)  real  values;  but 
without  loss  of  generality  may  be  restricted  to  be  integers  [Mor 
77].  A  is  a  staircasewise  thresholding  function  and  O  is  a  one- 
place  function  in  Zp. 

It  becomes  apparent  that  if  O  is  the  identity  function, 
Definition  2.1.  specifies  a  p-valued  threshold  function. 
Multiple-valued  threshold  functions  have  been  very 
thoroughly  studied  for  p=3  (see  e.g.  [AiA  70],  [NaM  74],  [Mor 
77]).  These  functions,  as  in  the  case  of  binary  threshold 
functions  are  characterized  by  their  Chow^-Parameters  [AiA 
70],  or  equivalently,  by  the  first  order  Chrestenson  spectral 
coefficients  of  the  function  [Mor  79a],  [Mor  79b],  Results  for 
p>3  may  be  found  in  [Mor  79c],  [MSO  82]  and  [Mor  89].  It 
has  been  shown  that  if  O  is  allowed  to  be  the  identity  or  a 
mirroring  negation,  there  are  471  2-place  and  85,629  3-place 
ternary  threshold  functions  [AiA  70],  [Mor  77],  and  18,184  2- 
place  quaternary  threshold  functions  [MSO  82],  respectively. 
Of  course,  threshold  functions  constitute  a  functionally 
complete  set,  however  as  in  the  case  of  Rosenblatt’s 
perceptron  [Ros  57],  [Ros  58]  the  fact  that  threshold  functions 
are  defined  in  terms  of  hard  steps  impaired  the  use  of  a 
gradient  descend  algorithm  to  systematically  develop  efficient 
networks  based  on  threshold  gates.  This  was,  among  others, 
one  of  the  main  reasons  to  start  adding  degrees  of  freedom  to 
the  definition  of  threshold  functions  to  increase  their  range  of 
realization  possibilities.  Probably  one  of  the  first  contributions 
in  this  direction  was  the  introduction  of  polynomial 
separability  [Mor  72],  [Mor  75],  [Mor  77],  which  corresponds 
to  the  idea  of  higher  order  neurons  (in  the  world  of  artificial 
neural  networks)  [RoM  91].  In  this  case  /  is  no  longer  affine, 
but  a  quadratic  or  higher  order  function.  However  due  to  the 
discrete  structure  of  ZpD  it  may  be  realized  with  the  help  of 
unary  functions,  since  the  separating  polynomials  may  be 
properly  replaced  by  ’’Manhattan*4— trayectories  reaching  an 
equivalent  separation  [Mor  77],  [Mor  79c].  The  other  early 
contribution  was  the  introduction  of  multilineal  separability 
[NaM  75],  [Mor  77],  [Mor  79c],  [Mor  89],  which  actually 
leads  to  realization  of  multiple-valued  functions  as  a  network 
of  elementary  two-valued  threshold  functions  connected  to  a 
(quantizing  adding  gate  realized  as  a)  p-valued  threshold  gate, 
where  every  threshold  selects  the  corresponding  value  from  Zp. 
(Of  course  threshold  functions  are  trivially  multilineal 
separable).  It  becomes  apparent  that  this  may  be  extended  to 
combine  polynomial  and  multilineal  separability  in  one 
network  to  increase  flexibility.  This  was  done  and  called 
’’Adaptive  Separability*4  in  [Mor  79c],  Related  aspects  have 
recently  been  addressed  [AbA  98]  within  the  scope  of  Frontier 
Algorithms  [McW  92]. 

To  conclude  this  section  another  view  of  classical  p-valued 
threshold  functions  and  of  the  above  presented  extensions  will 
be  given  from  a  more  formal  point  of  view. 

Definition  3:  Subdomains 

VkeZplet  =  {x  e  Zpn  |  f(x)  =  k} 


It  becomes  apparent  that  a  subdomain  may  be  empty  if  the 
function  is  not  suprayective. 

Definition  4:  p-valued  linear  separable  or  threshold  functions 
A  p-valued  function  f:  Zpn  ->  Zp  is  linear  separable  or 
threshold  iff  there  exists  a  set  of  parallel  hyperplanes 
separating  the  subdomains  of  Zpn  in  a  monotonic  way.  (See 
example  in  figure  1 .) 


Figure  1 :  A  linear  separable  or  threshold  4-valued  function 
(o  =0.  x=  l.*  =  2. 0=3) 

Definition  4. 1 :  p-valued  polynomial  separable  functions 

A  p-valued  function  f:  Zp"  Zp  is  polynomial  separable  if 
there  exists  a  set  of  parallel  hyper. surfaces  separating  the 
subdomains  of  Zpn  in  a  monotonic  way.  (See  example  in  figure 
2.) 


Figure  2:  Polynomial  separation  of  a  4— valued  function 
(O  =  0,  x  =  1,#  =  2,  □=  3) 

Definition  4.2:  p-valued  multilineal  separable  functions 
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A  p-valued  function  f:  Zpn  Zp  is  multilineal  separable  if 
there  exists  a  set  of  non-necessarily  parallel  hyperplanes 
separating  the  subdomains  of  Zpn.  Notice  that  in  this  case  more 
than  one  hyperplane  may  be  used  to  separate  two  neighbour 
subdomains.  (See  example  in  figure  3.) 


Figure  3:  Multilineal  separation  of  a  4-valued  function 
(O  =  0,  x  =  1,®  =  2,  □=  3) 

The  combination  of  the  above  extension  leads  to  Adaptive 
Separable  functions.  Figure  4  illustrates  the  adaptive 
separation  of  the  same  4— valued  function  of  figure  2,  where 
two  polynomials  are  replaced  by  literals  (which  are  1-place 
threshold  functions)  and  the  third  by  a  line.  The  following 
realization  is  obtained: 


ifV, 

—  x2  <  0.5 

then 

fii(x)  = 

1  else  0 

ifV, 

— x2  <1.5 

then 

fl2(x)  = 

1  else  0 

if  xj+  x2  >  5.5  then 

f2(x)  = 

1  else  0 

where  Sc*  =  3  iff  i  <  x  <  j  else  0.  This  leads  to: 

fW  =  fii(x)  +  fi2(x)  +  f2(x)  (1) 

Notice  that  if  the  complement  of  x,  written  xA  is  defined  to  be 
3-x,  then  the  former  equations  for  fn(x)  and  fi2(x)  may  be 
given  another  expression,  where  the  auxiliary  unary  function 
will  be  monotone  increasing. 


Since  °x\  =  3  -  (°x\)A  =  3  -  2x3i  the  inequality 
V,  -x2<0.5 


turns  into 

3-Vi  -x2<0.5 

leading  to 

ifVi  +x2>2.5 

then 

fll(x)  = 

1  else  0 

and  similarly 

if  3x3j  +x2  >1.5 

then 

fl2(x)  = 

1  else  0 

3.  Design  of  neural  networks 

In  the  general  case  neural  networks  are  designed  to  solve 
continuous  non-linear  separable  problems  specified  as  fc:  Cn 
-»  C.  Neural  networks  have  the  structure  of  a  graph,  where 
each  node  is  assigned  the  functional  lity  of  a  neuron.  The  most 


Figure  4:  Adaptive  separation  of  the  function  of  figure  2 
(0=  0,  x  =  1,  •=  2,  □=  3) 

graph  structure,  is  known  as  ’’feedforward64  network  and  is 
adjusted  -(’’trained66)-  by  means  of  a  finite  set  of  repre¬ 
sentative  input-output  examples  that  drive  a  gradient  descend 
algorithm  searching  for  a  minimum  of  the  square  error 
(produced  by  the  unadjusted  network).  There  are  several 
crucial  problems  that  accompany  this  process.  The  number  of 
required  i/o-examples  to  properly  adjust  the  network  is 
proportional  to  a  quantity  known  as  the  Vapnik-Chervonenkis 
Dimension  [Blu  89]  and  this  depends  on  structural  properties 
of  the  network  and  of  the  activation  function  of  the  neurons 
[KoS  97].  A  network  is  said  to  be  properly  adjusted  if  not  only 
is  able  to  reproduce  the  training  examples,  but  give  reasonable 
output  to  unknown  inputs  drawn  from  the  same  set  of  cases 
from  where  the  examples  were  taken.  This  is  known  as  PAC 
Learning  [Val  84]  in  the  algorithmic  learning  theory.  Finally, 
in  order  to  apply  the  gradient  descend  algorithm,  it  is  needed 
that  all  activation  functions  are  overall  differentiable.  (This 
immediately  rules  out  all  hard  limiters  like  the  staircase 
function,  which  is  typical  of  multiple-valued  neurons.) 

Neural  networks  designed  to  solve  the  especial  class  of 
discrete  non-linear  problems  specified  as  fa:  Zn  — >  Z  are  less 
critical  in  the  sense  that  they  do  not  have  to  generalize  in  the 
closest  dense  sorrounding  of  the  training  examples,  but  the 
other  problems  mentioned  above  basically  remain. 

Neural  networks  for  multiple-valued  functions  are 
specified  as  fm:  Zpn  -»  Zp  and  the  main  difference  with  the 
former  case  (beyond  the  finitness  of  domain  and  range)  is  that 
usually  all  pn  evaluations  of  fm  are  known  and  available  as 
training  examples  (although  not  all  of  them  may  be  necessary 
as  training  examples  (see  e.g.  [AbA  98]).)  Thus,  the 
generalization  problem  disappears.  The  problem  that  the 
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activation  functions  cannot  be  hard  limiters  is  more  a  ’’model” 
problem  than  a  ’’realization"  problem.  (Recall  the  strong 
efforts  that  are  needed  to  ’’correct"  the  step  response  of  an 
amplifier  from  being  sigmoid  like  to  become  close  to  a 
’’corner".)  Since  without  loss  of  generality  all  weights  may  be 
chosen  to  be  integers  [Mor  77],  it  is  important  to  construct  soft 
staircases  satisfying  two  conditions: 

i)  It  should  be  possible  to  have  steps  to  detect  consecutive 
integer  values  (of  an  input  weighted  summation),  and 

ii)  The  transitions  of  a  step  from  low  to  high  (or  vice  versa) 
measured  as  the  inverse  value  of  the  tangent  at  the  inflection 
point,  should  be  fast  enough  not  to  interfer  with  threshold 
detection.  This  may  be  accomplished  by  taking  a  transition 
margin  no  larger  than  0.333. 

4.  Analysis  of  examples 

Unless  an  evolutionary  design  method  is  used  [Kit  91], 
[Yao  93],  [HeM  96],  [Hei  97],  [FrM  97],  in  the  case  of 
feedforward  neural  networks  a  tentative  network  must  be  a 
priori  selected  and  then  adjusted.  A  differentiable  soft 
staircase  activation  function  may  be  constructed  based  on  the 
following  sigmoid: 

S(y)  =  [1  +  e'20y]-1  from  where  S(0)  =  0.5 

It  is  simple  to  show  that  S’(y)  =  20S(y)(l-S(y)). 

It  follows  that: 

S’(0)  =  20  0.5  (1-0.5)  =  5 
1/S’(0)  =  0.2  <  0.333 

Moreover  S(l)  =  [1  +  e20]'1  =  [1  +  2.06  109]1 

S(l)  =  0.99999999794  (i.e.  *  1) 

A  soft  staircase  activation  function  for  multiple-valued 
neural  networks  may  be  constructed  as: 

A(y)  =  ^S(y-bi)  (2) 

i=I 

where  bj  is  a  bias  to  properly  displace  the  elementary  sigmoids. 
This  is  illustrated  in  figure  5  for  p=4,  bi  =  0.5,  hi  =  1.5  and  b3 
=  2.5 

The  design  of  a  quaternary  full  adder  has  been  chosen  as 
a  first  test  problem.  The  architectures  shown  in  figure  6  will 
be  used  as  basic  neural  netrworks  and  in  both  cases,  weights 
and  bias  values  must  be  learned  to  satisfy  the  truth  table  of 
the  full  adder.  It  is  known  [Mor  89]  that  the  network  in  figure 
6a  is  minimal,  meanwhile  the  network  in  6b  has  a  redundant 
node.  All  neurons  have  an  affine  input  function,  a  soft 
staircase  activation  function  and  a  linear  output  function.  The 
training  was  done  with  the  Stuttgart  Neural  Network 
Simulator  SNNS  [Zel  95]  with  Rprop  [Rie  94]  as  gradient 
descend  algorithm.  The  obtained  weight  and  bias  values  are 
summarized  in  table  1.  (Since  SNNS  is  a  design  and 


simulation  tool  for  classical  neural  networks,  i.e.  the 
activation  functions  are  soft  steps  that  have  only  two 
asymptotic  values,  the  quaternary  neurons  were  simulated  by 
three  two-valued  neurons  followed  by  an  adder,  as  suggested 
by  eq.  (2)  or  simply  by  using  the  property  of  multilineal 
separability  (see  eq.  1).  Moreover  the  inherent  symmetry  of 
the  problem  was  taken  in  consideration  to  speed-up  the 
training.). 


Figure  5:  Soft  staircase  for  4— valued  neurons 


xi  X2  Cj  (a) 

I  hm-r^ 

i  1  =$>► 


X,  x2  ct  (b) 


Figure  6:  Full  adder  quaternary  networks 
(See  Table  1  for  weights  and  bias  values) 


Table  1 :  Summary  of  learned  weights  and  biases 

Circuit 

hidden 

carry  out 

sum 

weights  (bias) 

weights  (bias) 

weights  (bias) 

6a 

— 

1  1  1 

(3.5;  6.5;  6.5) 

111-4 
(0.5;  1.5;  2.5) 

6b 

1 1 1 

(3.5;  6.5;  6.5) 

000  1 

(0.5;  1.5;  2.5) 

111-4 
(0.5;  1.5;  2.5) 
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The  second  test  example  is  the  function  shown  in  figure 
2  with  the  separation  suggested  in  figure  4  and  discussed 
earlier  in  the  text.  The  first  direct  circuit  realization  leads  to 
the  network  shown  in  figure  7a.  It  has  been  shown  [Mor 
77],  that  this  may  be  reduced  to  the  network  of  figure  7b. 


Figure  7.  (a)  Adaptive  separation  of  test  function 

(b)  Reduced  adaptive  realization. 

All  the  weights  of  the  threshold  functions  are  1.  The 
bias  values  for  the  ”hidden“  threshold  gates  match  the 
values  given  in  the  equations  leading  to  or  following  eq.  1 , 
respectively.  The  bias  values  of  the  output  gate  of  the 
reduced  network  are:  2.5;  4.5  and  7.5. 

5.  Conclusions 

Studies  in  multiple-valued  threshold  logic  done  in  the 
seventies  find  a  new  focus  of  attention  from  the  point  of  view 
of  neural  networks.  Making  use  of  formal  developments  in 
this  area,  it  is  possible  to  design  multiple-valued  neural 
networks  of  different  classes  in  an  algorithmic  way  (gradient 
descend).  The  idea  of  considering  an  output  function  for 
neurons,  which  may  produce  any  permutation  -(and  not  only 
the  negation)-  of  values,  suggested  by  the  MV-community 
[AbA  98],  [NRS  98],  may  be  of  interest  for  possible 
applications  in  discrete  neural  networks  with  finite  range. 
(Classification  problems  are  possibly  the  simplest  but  most 
frequent  problems  where  this  variation  might  find  a 
reasonable  use.)  It  should  be  recalled  that  designing 
multiple-valued  circuits  as  feedforward  (discrete)  neural 


networks  by  means  of  a  gradient  descend  algorithm  requires 
an  a  priori  decision  on  the  configuration  of  the  circuit  (which 
may  a  posteriori  be  pruned  if  redundant  gates  are  detected). 
If  a  really  ffee  design  of  a  multiple-valued  circuit  as  a 
feedforward  (discrete)  neural  network  is  looked  for,  then  an 
evolutionary  strategy  should  be  used  [WaM  96],  [HeM  96], 
[MoW  9 8]. In  fact,  by  using  the  evolutionary  design  method 
developed  in  [Hei  97]  the  minimal  solution  for  the  quaternary 
full  adder  was  obtained  without  giving  a  priori  the  structure 
of  the  network. 
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Abstract 

The  Supplementary  Symmetrical  Logic  Circuit  Structure 
(SUS-LOC)  is  a  fully  active,  self  sustaining  architecture 
intended  primarily  for  the  design  and  fabrication  of  logic 
synthesizing  circuits  with  a  radix  greater  than  two .  Any  ‘re¬ 
valued  logic  function  of  ‘n  * -places  (where:  *r '  is  the  radix 
and  an  integer  greater  than  1,  and  *n  *  is  an  integer  greater 
than  0)  can  be  implemented  with  the  SUS-LOC  structure. 

1.0:  Introduction 

The  features  “self  sustaining”  and  “fully  active”  are 
essential  to  the  structure  used  to  implement  Multiple  Valued 
Logic  (MVL)  circuits.  For  the  purpose  of  this  paper  these 
terms  are  defined  as  follows: 

A  self  sustaining  architecture  is  one  capable  of 
implementing  at  least  one  functionally  complete  set  of 
connectives. 

A  fully  active  architecture  is  one  that  ideally  achieves  a 
quiescent  power  consumption  of  zero;  realistically,  the  lowest 
quiescent  power  consumption  is  that  of  leakage  currents  only. 

To  date,  the  only  self  sustaining,  and  fully  active  circuit 
structure  has  been  the  Complementary  Symmetrical  Metallic 
Oxide  Semiconductor  circuit  structure  (CMOS)  when  used  to 
implement  binary  logic  functions.  However,  CMOS  is  not 
capable  of  implementing  MVL  functions  and  remaining  fully 
active.  The  only  self  sustaining  circuit  structure  intended  for 
the  implementation  of  MVL  functions  has  been  Current  Mode 
CMOS  Logic  (CMCL).  However,  CMCL  is  not  fully  active. 
See  references  [3],  [4],  and  [5]. 

MVL  circuits  must  be  high  speed  with  a  low  power 
requirement,  have  well  defined  logic  levels  and  be  easily 
manufactured  to  be  competitive  with  binary  logic.  To  achieve 
MVL  circuits  with  such  attributes,  MVL  circuits  must  be 
based  upon  a  fully  active,  self  sustaining  architecture. 


2.0:  The  SUS-LOC  Structure 

The  Supplementary  Symmetrical  Logic  Circuit  Structure 
(SUS-LOC)  uses  a  technique  called  “supplementation”  to 
detect  and  generate  stable,  well  defined  intermediate  logic 
levels.  The  essence  of  supplementation  is  the  use  of  two 
switches,  connected  to  conduct  in  series,  for  each  intermediate 
logic  level  between  two  terminus  logic  levels. 

The  simplest  form  of  a  SUS-LOC  circuit  with  one 
intermediate  logic  level  consists  of:  1)  one  switch  for  each  of 
two  terminus  logic  levels;  and  2)  two  switches  connected  to 
conduct  in  series,  and  sharing  the  control  signal  with  the 
termini,  for  die  intermediate  logic  level  thereby  supplementing 
the  termini  logic  levels. 

The  advantages  of  the  SUS-LOC  structure  include: 

1 .  a  quiescent  power  requirement  consisting  of  leakage 
currents  only; 

2 .  improved  fan-out  characteristics; 

3 .  well  defined  logic  level  domains; 

4.  switching  times  in  the  low  nano-seconds; 

5 .  multiple  implementations  of  most  functions; 

6.  redundant,  functionally  complete  sets  of  logic;  and 

7.  all  one  place  functions  (OPF)  of  radix  V  are 
implemented  without  a  multiple  place  function  (MPF). 

2.1:  Branch  Definitions 

Any  element  that  connects  or  disconnects  an  output  terminal 
to  or  from  a  source  of  power,  in  response  to  an  input  stimulus, 
is  a  branch.  The  SUS-LOC  structure  contains  two  primary 
branches  named  Terminus  and  Intermediate,  and  one 
secondary  branch  named  Composite.  Generally,  all  functions 
require  two  terminus  branches  per  input  term,  (the  exceptions 
are  not  within  the  scope  of  this  paper). 

A  terminus  branch  consists  of  one  switch  per  input  term, 
which  connects  an  output  terminal  to  a  source  of  power 
representing  a  logic  level  and  is  responsive  to  either  one  or  a 
group  of  contiguous  input  logic  levels. 
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2,5:  Power  Supply  Voltages  and  Designators 


The  presence  and  numeration  of  intermediate  branches  is 
dependant  upon  the  radix  and  specific  logic  function  being 
synthesized.  An  intermediate  branch  consists  of  two  switches, 
per  input  term,  connected  in  series,  which  connect  an  output 
terminal  to  a  logic  level  voltage  between  those  conducted  by 
the  terminus  branches.  An  intermediate  branch  is  responsive 
to  either  one  or  a  group  of  contiguous  input  logic  levels. 

Composite  branches  are  a  combination  of  primary  branches 
connected  in  series,  parallel,  or  series-parallel  as  required  by 
the  multiple  place  function  in  which  they  occur. 

2,2:  Switches 

The  switches  selected  for  this  disclosure  of  die  SUS-LOC 
structure,  due  to  their  low  cost,  high  reliability,  and  ease  of 
manufacture  are  insulated  Gate  Field  Effect  Transistors, 
(FET(s)).  However,  any  switch  with  similar  properties  is  a 
suitable  substitute. 

The  channel  type,  mode,  and  threshold  voltage,  (VGS(TH)), 
of  each  FET  is  fabricated,  or  selected,  as  required  by  the 
circuitry  being  fabricated. 

Because  FETs  have  been  selected,  a  method  of  protecting 
the  inputs  against  the  damaging  effects  of  input  over/under 
voltage  and  electro-static  discharge  should  be  employed. 

2,3:  Symbols 

Figures  la  through  Id  show  die  symbols  used  to  represent 
the  various  FETs  in  schematics.  Near  the  gate  of  each  FET  is 
a  M+V"  or  "-V"  label  to  indicate  the  polarity,  and  magnitude 
of  die  FET’s 

2*4:  Mode  and  Channel  Type  Determination 

The  relationship  of  the  input  logic  levels  (I)  responded  to 
verses  the  output  logic  level  (O)  of  a  branch  determines  the 
channel  type  ‘P’  or  ‘N’  and  mode.  Enhancement  or  Depletion 
(E  or  D),  of  the  FET(s)  used  to  form  the  branch. 

There  are  typically  two  possible  I/O  relationships  for  a 
terminus  branch,  when:  I  <  O  use  ‘P’E;  I  >  O  use  ‘N’E. 

There  are  three  possible  I/O  relationships  for  an 
intermediate  branch,  when:  0>I  use  ‘P’E  and  N’D;  Oe  I  use 
‘P’D  and  ‘N’D;  0<I  use  ‘P’D  and  ‘N’E.  The  simplest  OPF 
containing  all  three  intermediate  branch  combinations  is  the 
radix  5  Base  -1  Complementer,  (inverter),  shown  in  figure  3c. 


While  the  power  supply  voltage  limits  are  determined  by 
the  specifications  of  the  FETs  employed,  the  suggested 
minimum  supply  voltages  for  logic  levels  0  and  1  are  0.0  volts 
and  1.5  volts,  respectively.  And  each  additional  logic  level  be 
the  previous  logic  level  voltage  plus  the  logic  level  1  voltage 
to  provide  a  Logic  Step  Voltage  (LS  V)  of  1 .5  volts. 

In  schematics  the  power  supply  designators  are  simply  the 
letter  “V”  (voltage)  subscripted  with  the  logic  level 
represented  by  that  voltage,  (e.g.  V3  represents  logic  level  3). 

2.6:  Lo{pc  Level  Determination 

Due  to  die  high  input  impedance  of  FETs  and  because  each 
supply  voltage  represents  only  one  logic  level,  the  logic  levels 
of  SUS-LOC  based  circuits  are,  in  virtue,  equal  to  the  supply 
voltages  representing  the  logic  levels.  Section  2.7  establishes 
a  Domain  for  each  of  the  V  different  logic  levels. 

2.7:  Threshold  Voltage  Determination 

The  of  P-channel  FETs  is  set  at  a  percentage  of  the 

LSV  above  the  highest  input  logic  level  to  which  they  are  to 
conduct.  The  VGS(TH)  of  N-channel  FETs  is  set  at  a  percentage 
of  the  LSV  below  the  lowest  input  logic  level  to  which  they 
are  to  conduct. 

This  percentage  of  the  LSV,  at  which  the  VGS(1H)  of  the 
FETs  is  set,  is  called  the  Overlap  Percentage  (OP)  and  has  a 
suggested  range  of  55%  to  75%  of  the  LSV,  such  that  an 
overlap  of  ’ON'  branches  is  obtained  when  the  circuit  is 
switching  from  one  output  logic  level  to  another.  It  is 
suggested  that  the  OP  be  the  same  for  all  switches  in  digital 
applications,  analog  applications  may  require  that  die  VGS(TH), 
OP,  and/or  the  LSV  be  variable. 

When  used  as  suggested  the  OP  maintains  symmetry, 
enhances  die  transfer  characteristics,  and  creates  a  Domain  for 
each  logic  level. 

To  calculate  the  VGS(TH)  for  a  particular  FET,  use  the 
following  equations  selected  for  the  FET’s  channel  type, 
where:  Vi  is  die  input  logic  level  voltage  limit  responded  to, 
and  Vo  is  the  required  output  logic  level  voltage. 

P-channel: 

VGs(th)  ~Vi  -  (Vo  -  (OP  x  LSV)) 

N-channel: 

Vgsoh)  =  Vi  -  (Vo  +  (OP  x  LSV)) 
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3*0:  Example  Functions 

Because  those  skilled  with  binary  logic  are  primarily 
concerned  with  die  OPF  called  an  Inverter,  as  well  as  the 
NOR,  NAND,  and  XOR  multiple  place  fimctions,  the 
analogous  ternary  functions  are  shown  and  briefly  discussed. 

The  following  parameters,  (selected  for  simplicity),  were 
used  to  develop  the  ternary  circuits  shown  and  discussed: 

V2  =5.0  volts  LSV  =  2.5  volts 

Vl  =2.5  volts  OP  =  70% 

V0  =0.0  volts 

3.1:  Ternary  Base  -1  Complementer  (Inverter) 

The  ternary  Base  -1  Complementer,  or  f(x)  =  <210>,  is 
analogous  to  the  binaiy  inverter.  The  ternary  Base  -1 
Complementer  schematic  and  its  symbol  are  shown  in  figures 
2a  and  2b  respectively. 

SPICE  simulation  of  the  chain  of  ten  ternary  Base  -1 
Complementers  shown  in  figure  2c,  using  1  micron  transistor 
models,  produced  the  output  waveforms  shown  in  figure  2d. 

Shown  in  table  A  is  die  average  power  information  from  the 
simulations  of  the  ternary  chain  of  ten  and  a  binary  chain  of 
ten  invertors.  The  average  power  of  die  ternary  chain  is  lower 
than  the  binaiy  chain  when  the  input  is  the  ternary  sequence 
of  0-1-2-1-0.  The  average  power  of  the  ternary  chain  is  higher 
when  die  input  sequence  is  0-2-0,  however,  it  is  not  twice  the 
binary  chain’s  average  power  although  the  ternary  chain 
traverses  20  logic  levels  as  opposed  to  the  binaiy  chain’s  10. 


|  TABLE  A  j 

Input 

Binaiy 

Ternary 

0-2-0 

29.1MW 

50.5//W 

0-1-2-1-0 

N/A 

16.69//W 

3.2:  Functions:  f(x)  =  <200>  and  <220> 

Because  two  ternary  fimctions  f(x)=<200>  and  f(x)=<220> 
are  used  symbolically  in  section  4.4  below,  their  schematics 
are  shown  in  figures  3a  and  3b,  respectively. 

4.0:  Examples  of  Multiple  Place  Functions 

An  MPF  is  essentially  the  combination  of  two  or  more 
OPFs  of  the  same  radix,  (MPFs  with  multiple  radices  is  not 
within  the  scope  of  this  paper).  Each  input  of  an  MPF  is  the 
input  to  one  or  more  of  the  OPFs  used  to  form  the  MPF. 


4.1:  Complementing  Generalized  OR 

The  Complementing  Generalized  OR  gate  (CGOR3), 
shown  in  figure  4,  is  analogous  to  a  binary  NOR  gate.  The 
output  of  a  CGOR3  gate  is  the  Base  -1  Complement  of  the 
highest  input  logic  level  presented  to  its  inputs. 

4.2:  Complementing  Generalized  AND 

The  Complementing  Generalized  AND  gate  (CGAND3), 
shown  in  figure  5,  is  analogous  to  a  binaiy  NAND  gate,  and 
its  output  logic  level  is  the  Base  - 1  Complement  of  the  lowest 
input  logic  level  presented  to  its  inputs. 

4.3:  Combined  One  Place  Functions 

To  show  that  an  MPF  is  the  combination  of  two  or  more 
one  place  fimctions  of  the  same  radix,  as  discussed  in  section 
4.0  above,  figures  4  and  5  are  referenced.  Both  gates  are 
comprised  of  one  radix  3  Base  -1  Complementer  per  input 
term.  One  Complementer  is  comprised  of  Q2,  Q4,  Q6  and 
Q7,  while  the  other  is  comprised  of  Ql,  Q3,  Q5  and  Q8. 

The  distinction  between  a  CGOR,.  and  a  CGANp  is 
defined  by:  1)  which  composite  branch,  comprised  of 
terminus  branches,  is  connected  in  series,  and  which  is 
connected  in  parallel;  and  2)  which  channel  type  forms  the 
series  and  parallel  portions  of  the  composite  branch  formed  by 
the  intermediate  branches. 

4.4:  Complementing  Exclusive  Generalized  OR 

The  Complementing  Exclusive  Generalized  OR  (CXGOR3) 
is  analogous  to  a  binary  XOR  gate.  And,  two  different 
implementations  are  presented  in  figures  6  and  7  to  show  the 
result  of  selecting  different  OPFs  to  form  a  gate. 

Each  implementation  of  the  CXGOR3  has  a  different  speed, 
power  requirement,  and  production  consideration  based 
completely  on  the  OPFs  selected  to  form  the  function. 

5.0:  Multiple  Implementation  of  Functions 

Because  multiple  implementations  of  a  given  function  are 
possible  using  the  SUS-LOC  structure  each  different 
implementation  of  a  function  can  have  a  different  set  of  speed, 
power  requirement,  component  count,  and  production 
consideration  trade-offs.  This  is  illustrated  by  the  different 
implementations  of  the  CXGOR3  in  figures  6  and  7 . 

To  manually  resolve  the  power,  speed,  component  count, 
and  production  consideration  trade-offs  would  require  a  time 
consuming  examination  of  the  implementations  available. 
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An  alternative  approach  is  to  automate  the  design  process 
thereby  allowing  the  trade-offs  to  be  resolved 
programmatically  as  outlined  in  reference  [2],  or  a  modified 
version  thereof. 

The  modifications  would  be  such  that  the  optimization  of  a 
connective,  or  block  of  logic  would  include  both  the  logical 
function  as  well  as  the  hardware  characteristics  of  the  various 
connective  implementations  available,  and  allow  manually 
entered  criteria.  The  use  of  programmatic  methods  would  be 
an  efficient  approach  to  finding  the  best  OPFs  to  form  MPFs, 
and/or  OPFs  and  MPFs  to  form  a  given  block  of  logic. 

Conclusion 

This  paper  has  presented  the  fundamentals  of  die  SUS-LOC 
structure,  a  fully  active  and  self  sustaining  architecture  for  the 
design  and  manufacture  of  MVL  circuits. 

The  SUS-LOC  structure  provides  circuit  designers  with  die 
ability  to  realize  redundant  functionally  complete  sets  of  logic 
of  any  radix.  Logic  circuits  based  on  die  SUS-LOC  structure 
feature  high  speed,  well  defined  logic  levels,  and  low  power 
requirements,  especially  the  quiescent  power.  These  features, 
combined  with  the  necessary  fabrication  technology  being 
currently  available  and  well  known,  makes  the  SUS-LOC 
structure  well  suited  for  MVL  or  Post  Binary  logic  circuits. 
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Abstract 

We  present  efficient  algorithms  for 
the  bi-decomposition  of  arbitrary  incompletely  specified 
functions  in  variable- valued  logic .  Several  special  cases 
are  discussed.  The  algorithms  are  especially  applicable 
for  Data  Mining  applications,  because,  in  contrast  to 
the  general  multi-valued  approaches  to  function  decom¬ 
position  that  decompose  to  arbitrary  tables,  we  create  a 
network  from  multi-valued  two-input  operators  that  are 
selected  by  the  user.  Such  decompositions  lead  to  de¬ 
cision  rules  that  are  easier  to  understand  by  humans. 


1  Introduction 

Simple  Functional  Decompositions. 

Simple  disjoint  decompositions  F  =  H(G(B),  A)  have 
been  used  for  FPGA  synthesis  [13],  synthesis  for  layout- 
driven  logic  synthesis,  Machine  Learning  and  Data  Min¬ 
ing  [9,  10,  11,  6,  18,  19].  Larger  block  F  of  logic  is  split 
to  smaller  blocks  G  and  H ,  and  next  blocks  G  and/or 
H  are  recursively  split  to  smaller  and  smaller  blocks, 
until  they  become  non-decomposable,  or  until  they  can 
be  directly  realized  with  some  other  means  (such  as 
in  a  single  PL  A  block  of  a  CPLD).  Set  B  of  variables 
is  called  the  bound  set ,  and  set  A  is  called  the  set  of 
free  variables.  In  binary  decomposition  the  output  of 
function  G  is  encoded  as  a  binary  vector  of  functions 
Gi(B).  In  multi-valued  decomposition  the  output  of 
function  G  is  a  multi-valued  variable.  In  disjoint  de¬ 
composition  sets  A  and  B  have  no  common  elements. 
Efficient  methods  to  represent  functions  in  decomposi¬ 
tion  have  been  recently  created:  BDD-Encoded  Labeled 
Rough  Partitions  [6],  and  BDD-Encoded  Multi-Valued 
Decision  Diagrams  [5],  but,  for  the  simplification  of  pre¬ 
sentation,  in  this  paper  we  will  use  Kmaps  to  illustrate 
our  concepts.  The  decomposition  principle  is  always  to 
find  the  solution  with  the  smallest  possible  complexity. 
The  complexity  can  correspond  to  the  total  size  of  non- 
decomposable  blocks,  the  number  of  blocks,  or  some 
other  evaluation  of  the  result.  This  is  obvious  for  FPGA 
or  layout  circuit  applications,  but  it  holds  also  for  Ma¬ 


chine  Learning  and  Data  Mining,  where  a  simpler  ex¬ 
pression,  satisfying  the  Occam  Razor  Principle,  creates 
a  more  meaningful  solution,  and  one  that  minimizes 
the  learning  error  [4,  14].  Various  decompositional  ap¬ 
proaches,  mainly  based  on  Ashenhurst  and  Curtis  de¬ 
compositions,  have  been  realized  in  several  programs 
that  can  handle  both  binary  and  multi-valued,  com¬ 
pletely  and  incompletely  specified  functions  and  re¬ 
lations,  [11].  The  methods  have  been  also  extended 
to  non-disjoint  decompositions  [9,  10].  Simple  non- 
disjoint  decomposition  is:  F  =  H(G(B\  UC),  A\  UC), 
where  C  is  the  set  of  shared  variables.  Thus,  sets  A 
=  A\U  C  and  B  =  B\U  C  are  non-disjoint. 
Decomposition  in  Multi-Valued  Circuit  Design. 
Turning  our  attention  to  multi-valued  circuit  design, 
let  us  first  observe  that  there  exist  currently  very  few 
methods  for  synthesis  of  multi-level  multi-valued  cir¬ 
cuits.  They  include  factorization  methods ,  decision  di¬ 
agram  based-methods ,  and  decomposition  [3,  11].  Re¬ 
cent  decomposers  can  handle  data  with  don’t  cares  (and 
multi-valued  relations)  better  than  other  methods.  Two 
approaches  to  multi-level  multi-valued  decomposition 
are  possible.  The  first  approach,  Fixed- Multiplicity 
Decomposition ,  [3],  assumes  that  all  input  and  output 
variables,  as  well  as  all  intermediate  variables  G,  that 
are  created  in  the  decomposition,  have  not  more  than 
a  fixed  constant  number  of  values.  For  instance,  they 
have  at  most  3  values  for  decomposition  of  ternary  func¬ 
tions.  This  number  is  called  the  multiplicity  index.  The 
advantage  of  such  an  approach  is  that  it  allows  for  easier 
conversion  of  the  tables  describing  non-decomposable 
blocks  to  Multi-Valued  Sum  of  Products  (SOP)  circuits 
with  some  existing  MV  gates  from  cell  libraries,  or  with 
PLA-like  function  generators.  The  second  approach, 
Unrestricted- Multiplicity  Decomposition,  [11],  does  not 
assume  any  constraints  on  the  number  of  values,  and 
is  thus  better  suited  for  Data  Mining,  where  the  data 
are  naturally  of  this  type  because  the  attributes  (in¬ 
put  variables)  have  various  sets  of  values.  For  instance, 
variable  SEX  can  have  as  few  as  two  values,  and  vari¬ 
able  AGE  can  have  as  many  as  100  values.  After  the 
decomposition  of  this  type  to  tables  with  variables  of 
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different  radii,  the  symbolic  variables  are  next  encoded 
with  k- valued  signals.  The  advantage  of  this  approach 
is  sometimes  finding  a  decomposition  of  a  smaller  cost, 
because  of  a  more  general  decomposition  model  used. 
Also,  this  approach  allows  to  find  decompositions  that 
are  more  general  than  the  Curtis  decomposition,  but 
are  based  on  very  similar  principles  [3,  9,  10].  We  allow 
there  decompositions  with  arbitrary  values  of  multi¬ 
plicity  indices,  which  is  in  contrast  to  Curtis  decom¬ 
position  that  in  binary  case  requires  the  number  \i0  of 
output  signals  to  be  smaller  than  the  number  \ii  of  in¬ 
put  signals  to  the  block.  In  case  of  MV  decomposition 
this  constraint  translates  to  Ashenhurst  decompo¬ 
sition  with  single  r-valued  signal  for  the  block  or  to 
a  Curtis-like  MV  decomposition  with  r-valued  sig¬ 
nals  from  the  block  and  less  output  signals  than  input 
signals  to  the  block.  Our  decompositions  (both  binary 
and  multi-valued)  assume  none  of  these  constraints. 
The  disadvantage  of  both  Fixed-Multiplicity  and  Free- 
Multiplicity  decompositional  approaches  with  respect 
to  the  MV  factorization  and  MV  decision  diagram  ap¬ 
proaches  is  that  arbitrary  MV  blocks  are  created  as  a 
result  of  the  decomposition.  They  are  specified  by  ta¬ 
bles  with  k- valued  input  and  k-valued  output  variables. 
Such  tables  require  that  in  order  to  realize  the  circuit 
corresponding  to  this  decomposition,  the  blocks  spec¬ 
ified  by  the  tables  should  be  next  either  further  opti¬ 
mized  using  some  other  MV  synthesis  tools,  or  realized 
as  the  not  necessarily  optimized  mv-SOP  circuits  (us¬ 
ing  MIN,  MAX  and  literals),  that  directly  correspond 
to  the  non-decomposable  blocks.  From  the  point  of 
view  of  circuit  realizability  of  blocks,  it  would  be  better 
to  decompose  the  function  to  only  few  selected  types  of 
blocks  (MV  gates),  described  by  multi-valued  operators 
that  are  directly  realizable  in  hardware.  For  instance, 
it  can  be  shown  that  every  function  is  decomposable  to 
MV  inverters  or  window  functions ,  and  two-input  MIN 
(minimum)  blocks.  Such  realizations,  however,  may  be 
very  far  from  the  minimum.  (The  ternary  MV  inverters 
add  modulo-3  constant  1  or  2  to  the  value.  The  win¬ 
dow  literal  is  defined  as  follows:  X1  =  2  for  X=i  and 
0  otherwise.  These  types  of  single-input  functions  can 
be  easily  extended  to  any  number  of  values  and  realized 
in  hardware). 

Bi-Decompositions. 

Our  goal  here  is  to  present  decompositional  approaches 
that  decompose  to  finite  number  of  selected 
block  types,  and  we  will  achieve  this  by  the  gener¬ 
alization  of  the  bi-decomposition  methods  introduced 
originally  by  Davio  and  next  discussed  by  many  au¬ 
thors:  Bochmann  [20,  21],  Le  [22],  Steinbach  and  co¬ 
authors  [26]-citesteinbach-end,  Zakrevskij  [36,  37,  38], 
and  Sasao/Butler  [12].  The  bi-decomposition  approach 
for  Boolean  logic  is  based  on  AND,  OR  and  EXOR 
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Figure  1.  Netlist  with  disjoint  and  non-disjoint 
bi-decompositions 


decompositions.  It  decomposes  hierarchically  an  arbi¬ 
trary  function  to  three  types  of  gates:  AND,  OR  and 
EXOR.  Each  type  of  one-level  decomposition,  AND- 
decomposition,  OR-decomposition  or  EXOR- 
decomposition  can  be  disjoint  or  non-disjoint.  Ev¬ 
ery  function  was  proved  to  be  decomposable,  when 
non-disjoint  decompositions  are  also  allowed  [26].  Ob¬ 
serve,  that  generalizing  the  bi-decomposition  approach 
to  multi-valued  logic  would  allow  it  to  become  a  prac¬ 
tical  tool  for  multi-level  multi-valued  design.  As¬ 
suming  non-disjoint  decompositions  and  availability  of 
negation  or  window  functions  plus  multi-valued  con¬ 
stants,  only  MIN  decompositions  would  be  sufficient 
(this  can  be  proved  by  generalizing  the  proof  that 
NAND  and  constants  is  a  complete  system  for  binary 
logic). 

In  another  variant,  one  would  use  additionally  the 
MAX  gate,  which  would  lead  to  an  MV  counterpart  of 
AND/OR/NOT  binary  logic.  Creating  such  a  system 
is  possible  and  is  already  included  in  the  approach  pre¬ 
sented  below.  The  advantage  of  such  approach  would 
be  a  requirement  of  having  very  few  standard  MV  cells. 
However,  it  can  be  observed  that  the  desirable  possi¬ 
bility  of  finding  a  decomposition  with  no  shared  vari¬ 
ables  or  with  a  small  set  of  shared  variables  greatly  in¬ 
creases  with  a  larger  set  of  operators.  Larger  set  is  also 
a  good  assumption  for  Data  Mining  applications,  where 
many  different  kinds  of  relational,  mv-logic,  arithmetic, 
and  language-based  operators  are  used  by  knowledge 
engineers  and  human  experts  that  create  hierarchical 
rule- based  expert  systems  [17].  The  question  thus  re¬ 
mains,  how  many  gate  types  would  be  needed  to  ob¬ 
tain  satisfactory  solutions,  because  too  large  a  num¬ 
ber  of  them  would  be  not  practical.  Obviously,  there 
should  be  operators  realizing  known  operators  MIN, 
MAX,  MODSUM,  AVERAGE,  MODULO  MULTIPLI¬ 
CATION,  TRUNCATED  SUM  and  other.  Example  of 
a  netlist  being  a  Directed  Acyclic  Graph  of  non-disjoint 
and  disjoint  single-level  bi-decompositions  with  opera¬ 
tors  MIN,  MODSUM,  MAX,  GREATER,  AVERAGE 
LOW  and  DISTANCE  is  shown  in  Figure  1. 

It  was  proven  by  Shannon  [15]  that  when  the  number 
of  input  variables  n  grows,  the  ratio  of  disjointly  decom- 
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Figure  2.  Netlist  from  variable-valued  operators 
corresponding 

to  the  mixed  binary-MV  decision  rule  IF  (A  < 
B)  AND  MIN(C,  D)  =  MAX(U,V)THEN  R  = 
3  ELSE  R  =  AVERAGE(C,D).  Intermediate 
variables  X ,  Y  and  Z  are  binary.  Block  of  MV 
multiplexer  is  internally  built  from  non-disjoint 
decomposition  to  MAX,  two  MIN ,  and  two  level- 
changing  inverters /buffers. 


posable  functions  decreases  to  zero.  From  the  practical 
point  of  view,  however,  this  result  is  not  particularly 
restrictive,  because: 

(1)  Real-life  functions  are  usually  decomposable,  in 
contrast  to  randomly  generated  worst  case  functions 
used  in  mathematical  proofs  as  one  from  [15].  For  in¬ 
stance,  for  binary  functions  it  was  shown  experimen¬ 
tally  on  large  benchmark  sets  and  on  many  types  of  real- 
life  benchmarks  from  different  application  areas  that 
82%  of  them  were  decomposable,  while  only  1%  of  ran¬ 
dom  functions  were  decomposable  [14]. 

(2)  We  consider  non-disjoint  decompositions,  for  which 
it  was  proven  that  every  function  is  decomposable,  and 
practically  with  only  few  repeated  variables  (disjoint 
decompositions  are  still  preferable).  If  disjoint  decom¬ 
positions  are  not  possible,  we  select  small  sets  of  shared 
variables. 

(3)  Functions  with  don’t  cares  are  better  decompos¬ 
able:  the  more  don’t  cares,  the  more  probable  to  find 
a  disjoint  decomposition.  It  is  well-known  that  Data 
Mining  functions  have  extremely  high  percent  of  don’t 
cares.  As  demonstrated  in  [11],  introducing  of  shared 
(the  same  as  repeated)  variables  has  the  same  effect  as 
dealing  with  incomplete  functions;  one  repeated  binary 
variable  causes  creating  a  new  Kmap  with  half  of  don  t 
cares. 

Bi-Decompositions  for  Data  Mining. 

Let  us  now  turn  our  attention  to  a  non-disjont, 
operator-based,  hierarchical  decomposition  as  a  new 
approach  to  Data  Mining.  All  Machine  Learning  and 
Data  Mining  methods  assume  Occam  Razor  principle, 


which  is  practically  realized  by  minimizing  global  cost 
functions  such  as  Decomposed  Function  Cardinality 
(DFC)  [11],  total  multiplicity  index  (Column  Cardinal¬ 
ity),  or  other  measures  [16].  It  was  found  that  there 
is  a  very  high  correlation  between  the  DFC  value  and 
the  learning  error  for  most  of  the  benchmarks  [14,  4]. 
Another  important  factor  taken  into  account  when  eval¬ 
uating  various  decomposition  structures  with  blocks  is 
also  the  f,understandability,f  or  ” explainability”  prop¬ 
erty  of  the  resultant  set  of  rules  (expression,  network, 
circuit)  [17, 16].  Thus,  between  two  sets  of  rules  created 
by  a  decomposer  that  have  the  same  DFC  cost,  the  set 
of  rules  that  is  easier  to  understand  by  a  human  expert 
is  better.  Often,  the  human  can  further  improve  the 
rules,  and  it  was  shown  that  systems  that  combine  com¬ 
puter  and  human  knowledge  acquisition  on  these  tasks 
are  better  than  computers  or  humans  alone  [17,  16]. 
Moreover,  expert  system  users,  such  as  medical  doc¬ 
tors  or  lawyers,  do  not  want  to  use  systems  that  do 
not  provide  explanations  that  would  be  comprehensible 
to  them.  As  one  experienced  data  knowledge  engineer 
working  with  human  experts  observed:  '7  have  never 
met  a  doctor  who  would  use  EITHER-OR  (EXOR)  con¬ 
junctive  in  his  reasoning” .  Thus,  the  medical  doctor 
will  be  never  satisfied  with  the  diagnosis  provided  by  an 
expert  system  based  on  a  totally  ”  black-box”  approach 
(such  as  an  artificial  neural  net),  or  even  using  complex 
rules  with  exotic  to  them  operators  such  as  EXOR  or 
MODSUM.  In  addition,  it  was  observed  that  in  real- 
life  expert  systems  the  functions  realized  in  blocks  are 
in  most  cases  monotonic  [17].  Thus  monotonic  opera¬ 
tors  should  be  preferably  used  in  decomposition,  or  at 
least  given  preference  during  the  decomposition  choices. 
Another  preference  may  be  to  use  symmetric  operators. 
Concluding,  both  in  MV  circuit  design  and  Data  Mining 
applications,  there  is  a  need  for  a  method  to  synthesize 
a  strongly  unspecified  multi-valued  function  with  arbi¬ 
trary  numbers  of  values  in  variables  to  the  preselected 
set  of  simple  two-argument  operators,  leading  to  solu¬ 
tion  rules  such  as  one  illustrated  in  Figure  2.  The  goal 
of  this  paper  is  to  introduce  an  efficient  method  for  find¬ 
ing  bi-decompositions  of  strongly  unspecified  multiple¬ 
valued  functions,  for  selected  sets  of  two-input  opera¬ 
tors. 

2  Patterns  and  Decompositions  of 
Completely  Specified  Functions 

Operators  in  Ternary  Logic. 

For  simplicity,  we  will  illustrate  our  approach  using 
ternary  logic,  but  all  algorithms  can  be  formulated  for 
arbitrary  variable- valued  logic  using  Multivalued  Deci¬ 
sion  Diagrams  [5].  Some  subset  of  well-known  oper¬ 
ators  in  ternary  logic  are  shown  in  Figure  3.  There 
are  several  subsets  of  them  that  are  functionally  com- 
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Figure  3.  Selected  Operators  for  Ternary  Logic 
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Table  1.  Table  for  MIN  decomposition  of 
ternary  function  fx 
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plete,  with  single-input  window  functions,  inversion 
functions,  or  with  arbitrary  universal  window  func¬ 
tions  or  literals  applied  only  in  the  input  level  (as 
in  MV  SOP  realizations).  Selection  of  functionally 
complete  subsets  is  not  a  topic  of  this  paper  because 
for  Data  Mining  larger  sets  of  operators  are  better. 
Operators  from  Figure  3  are  only  examples,  and  the 
method  shown  below  will  lead  to  solutions,  possibly 
non-optimal,  for  any  functionally  complete  set  of 
operators.  It  can  be  easily  proven  from  the  funda¬ 
mental  theorem  of  MV  decomposition  [11]  that  if  de¬ 
compositions  F  =  <f>(G(B),A)  and  F  =  <j>(ByH(A)) 
exist  with  multiplicity  indices  pi  and  p2,  respectively, 
then  decomposition  F  =  <f>{G{B)y  H(A))  exists  where 
gate  <j>  has  its  respective  inputs  with  p\  and  P2  values. 
Thus,  there  exists  a  Fixed-Multiplicity  Decomposition 
with  MAX(p  1,1*2)  values.  For  instance,  if  p\  =  2  and 
P2  =  3  then  a  ternary  decomposition  of  a  ternary 
function  F  exists. 

MIN  Decomposition. 

To  introduce  our  ideas,  we  will  start  with  MIN  De¬ 
compositions  of  a  completely  specified  function.  MIN 
decomposition  has  the  form: 

f(X)  =  MIN(g(A),h(B)). 

For  instance, 

f(A,B,C,D)  =  MIN{g{A,B),  h{C,D)). 

Given  is  function  f\ ( A ,  J3,  C,  D)  from  Table  1.  It  can 
be  observed  in  Table  1  that  there  are  three  row  patterns 
and  three  column  patterns.  The  patterns  of  rows  are: 

000  000  000 
Oil  no  101 
012  120  201 

We  say  that  all  rows  that  have  the  same  pattern  are 
compatible.  The  patterns  of  columns  are: 

000 

on 

012 

012 

000 


Table  2.  First  stage  of  decomposition.  Func¬ 
tions  h(C,  D)  and  g(A,  B)  for  MIN  decomposi¬ 
tion  of  ternary  function  fi.  These  functions 
are  created  by  labeling  with  symbols  0,1  and 
2  identical  rows  and  identical  columns  of  the 
table,  respectively 


Oil 

Oil 

012 

000 

Columns  with  the  same  pattern  are  called  the  compat¬ 
ible  columns .  Table  2  presents  the  first  stage  of  finding 
the  MIN  decomposition.  By  labeling  all  (compatible) 
rows  of  zeros  by  0,  all  rows  of  zeros  and  ones  by  1, 
and  all  rows  of  zeros,  ones  and  2’s  by  2,  we  obtain  the 
additional  column  which  describes  function  h{C)  D). 
By  labeling  all  (compatible)  columns  of  zeros  by  0, 
all  (compatible)  columns  of  zeros  and  ones  by  1,  and 
all  (compatible)  columns  of  zeros,  ones  and  2’s  by  2, 
we  obtain  the  additional  row  which  describes  function 
<7(^4,  B).  From  the  additional  row  we  can  create  directly 
the  Kmap  of  function  g(A,  B ),  Figure  4a.  By  using  the 
permutation  operation  (1  —  2)  of  exchanging  rows  1  and 
2  of  variable  C,  Figure  4b,  we  obtain  the  map  of  func¬ 
tion  h(C,  D).  This  is  additionally  explained  in  Kmaps 
from  Figure  4c-f.  Finally,  the  entire  decomposition  of 
function  /i  can  be  drawn,  Figure  4g.  Observe  that  the 
inverter  (the  1-2  permuter)  operator  in  variable  C  was 
used.  There  exist  three  such  permuter  operators  (1-2, 
0-1,  and  0-2)  for  ternary  logic. 

Concluding,  the  algorithm  to  find  ternary  MIN  de¬ 
composition  can  be  summarized  as  follows. 

[1.]  Find  all  row  patterns. 

If  there  are  more  than  3  patterns,  exit. 
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[2.]  Find  all  column  patterns. 

If  there  are  more  than  3  patterns,  exit.  Table  5.  Table  for  functions  h(C,  D)  and  g(A,  B) 

[3.]  Label  patterns  as  presented  above.  If  this  is  a  table  for  MAX  decomposition  of  ternary  function  fo. 

of  MIN  operator  (as  in  Table  3),  then  MIN  decomposi-  These  functions  were  calculated  analogously 

tion  exists.  a*  'n  Table  2 


This  method  is  a  generalization  of  the  method 
from  [12].  Let  us  now  explain  another  possible  ap¬ 
proach  to  the  same  problem,  this  approach  generalizes 
the  method  from  [26].  Analyze  the  patterns  of  rows 
column-by-column .  We  find  that  after  removing  all  re¬ 
peated  columns,  there  are  only  three  patterns:  000,  012 
and  Oil.  Similarly,  removing  all  repeated  patterns  of 
rows  from  patterns  of  columns  above,  the  remaining 
patterns  are:  000,  012  and  011.  The  same  patterns 
as  before.  Thus  the  decomposition  operator  from  Ta¬ 
ble  3  was  found  which  is  the  operator  of  MIN.  Both  the 


Table  7.  Table  for  MODSUM  decomposition  of 


above  methods  will  become  useful,  when  it  comes  to  ternary  function  /3 


find  patterns  in  cofactors  of  large  bound  and  free  sets 
for  incompletely  specified  functions. 

MAX  Decomposition- 

Given  is  function  /2  from  Table  4,  with  bound  and  free 
sets  of  variables  B  =  {A,B}  and  A  =  {C,D}.  Following 
the  first  procedure  above,  Table  5,  we  find  the  realiza¬ 
tion  from  Figure  5  with  the  same  functions  h  and  g  as 
previously.  Now,  following  the  second  procedure  above, 
but  labeling  the  columns  and  rows  of  0’s,  l’s  and  2’s  as 
0;  the  columns  and  rows  of  l’s  and  2’s  as  1;  and  the 
columns  and  rows  of  2’s  as  2;  we  found  that  the  re¬ 


duced  pattern  table,  Table  6,  is  the  table  of  operator  Figure  5.  MAX  decomposition  for  function  /2 

MAX.  Analogously,  the  same  solution  is  found  using 
the  second  approach  which  we  leave  to  the  Reader. 


MODSUM  Decomposition. 
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1  CD  \  AB  -+ 

00 

01 

02 

10 

11 

12 

20 

21 

22 

00 

0 

0 

0 

0 

0 

0 

0 

0 

0 

01 

1 

1 

1 

1 

1 

1 

1 

1 

1 

02 

2 

2 

2 

2 

2 

2 

2 

2 

2 

10 

2 

2 

2 

2 

2 

2 

2 

2 

2 

11 

0 

0 

0 

0 

0 

0 

0 

0 

0 

12 

1 

1 

1 

1 

1 

1 

1 

1 

1 

20 

1 

1 

1 

1 

1 

1 

1 

1 

1 

21 

2 

2 

2 

2 

2 

2 

2 

2 

2 

22 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Table  9.  Table  of  function  temp  for  MODSUM 
Decomposition  of  function  f3 


Given  is  function  f3  from  Table  7,  with  bound 
set  of  variables  B  =  {A,B}  and  free  set  of  vari¬ 
ables  A  =  {C,D}.  Following  the  same  two  pro¬ 
cedures  as  previously,  we  find  that  decomposition 
f3  =  MODSUM(g(A,  B)}  h(C,  D)),  with  the  same 
functions  h{C,  D)  and  g(A ,  B)  as  in  the  two  previous 
examples,  and  the  reduced  pattern  table,  Table  8  is  the 
table  of  operator  MODSUM. 

In  this  case,  one  more  approach  is  possible: 

(1)  calculate  function  g:  first  row  of  f3:  012120201  g . 

(2)  calculate  a  temporary  function 

temp  =  Mod-diff(f3(A,B,C,D),  g(AyB)). 

(3)  calculate  function  h :  first  column  of  /  (constant 
values  in  each  ,  row  are  neccessary  for  MODSUM  de¬ 
composition) 


CD  h 
00  0 
01  1 
02  2 
10  2 

11  O 

12  1 
20  1 
21  2 
22  O 


Functions  g  and  h  can  be  found  as  previously  and  the 
operator  table  of  the  decomposition  operator  is  MOD- 
SUM,  see  Table  8. 

Composing  two-input  operators  from  other  op¬ 
erators. 

Even  if  a  decomposition  with  an  unknown  operator  is 
found  using  the  method  of  compatible  columns,  this  op¬ 
erator  can  be  still  composed  from  the  known  operators. 

Example:  Function  fa  from  table  in  Figure  6 
cannot  be  decomposed  by  MIN(A,B),  MAX(A,B)  or 
MODSUM(A,B).  But  fa  =  MAX(g,h ),  where 
functions  g  and  h  are  as  in  Figure  6b, c.  Thus, 

fa  =  MAX(MODSU  AT  (Oil,  002),  MODSU  M(002,  Oil)) 
see  Figure  6d,  where  <  Oil  >  and  <  002  >  are  universal 
literals. 

Conclusion  on  these  examples. 

The  above  three  examples  suggest  two  general  proce¬ 
dures  for  a  completely  specified  function.  The  first 
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Figure  6.  Schematic  of  composed  realization  of  an 
operator  using  MAX  decomposition  for  function 

u 


procedure,  that  we  will  call  the  Compatibility  Algo¬ 
rithm ,  is  based  on  labeling  arbitrary  compatible  pat¬ 
terns,  calculating  multiplicity  indices,  and  next  find¬ 
ing  tables  of  functions  F,  g  and  h.  This  procedure  is 
the  same  for  any  two-input  decomposing  operator,  it 
does  not  assume  any  specific  operator,  and  the  oper¬ 
ator  becomes  known  only  when  the  decomposition  is 
completed.  Thus,  the  user  has  no  possibility  to  influ¬ 
ence  the  decomposition  type.  This  may  lead  to  an  op¬ 
erator  that  is  not  one  of  the  selected  operators.  On 
the  other  hand,  this  method  allows  for  permuting  in¬ 
puts,  thus  each  operator  pattern  corresponds  to  one  of 
many  possible  compositions  of  two-input  and  single  in¬ 
put  operators,  so  it  is  efficient.  In  case  of  incompletely 
specified  function,  the  Compatibility  Algorithm  would 
require,  however,  coloring  of  two  large  incompatibility 
graphs',  one  with  nodes  for  the  cofactors  of  the  bound 
set,  and  the  other  one  with  nodes  for  the  cofactors  of 
the  free  set  [10].  The  graphs  are  large,  because  we  split 
the  set  of  input  variables  into  two  sets  of  approximately 
the  same  size.  If  for  both  of  these  graphs  we  are  able  to 
find  coloring  with  3  or  less  colors  (multiplicity  index  3 
or  less),  the  binate  decomposition  exists.  In  contrast  to 
the  2-Colorability,  which  is  polynomial  and  for  which 
we  have  implemented  an  efficient  algorithm  [8],  the  3- 
Color ability  Problem  is  NP-complete.  To  complicate 
things  more,  because  the  minimal  graph  coloring  is  usu¬ 
ally  not  unique,  several  decompositions  can  be  found, 
with  various  decomposition  operators.  Such  coloring- 
based  procedure  would  be  not  very  efficient,  and  would 
not  give  preferences  to  the  desired  decomposition  types 
such  as  monotonic,  or  symmetric.  The  second  ap¬ 
proach,  called  Operator  Pattern  Algorithm ,  assumes 
certain  type(s)  of  decomposing  operator,  and  looks  for 
a  decomposition  only  with  respect  to  this  operator.  It  is 
thus  an  operator- oriented  decomposition.  Therefore  be¬ 
low,  for  incomplete  functions,  we  will  present  a  more  ef¬ 
ficient  and  comprehensive  approach,  that  will  make  use 
of  specific  properties  of  the  bi-decomposition  of  multi- 
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valued  incomplete  functions,  and  will  combine  the  ideas 
from  both  the  algorithms. 


3  Decompositions  of  Incompletely 
Specified  Functions 


Number  of  Decompositions. 

In  case  of  a  k-  valued  logic  and  n  variables,  there  are  kk* 
functions.  If  we  combine  two  of  these  functions,  one  for 
rows  and  one  for  columns,  we  get  kk  *  kk  =  kk  +k 


functions  realizable  by  decomposition.  This  number  is 
much  smaller  than  the  number  of  all  possible  functions 
of  2  *  n  variables:  fc*2”  functions.  Taking  thus  an  arbi¬ 
trary  randomly  selected  function  of  n  variables,  we  have 
a  probability  of  1  -  that  this  function  is  not  de¬ 
composable.  Thus  random  complete  functions  of  many 
variables  are  very  unlikely  to  be  disjoint-decomposable. 
Recall,  however,  that  an  incomplete  function  of  2 n  vari¬ 
ables  specified  on  K  cares  corresponds  to  kk  n~K  com¬ 
plete  functions.  Thus  taking  an  arbitrary  function  with 

K  cares,  we  have  the  probability  of  (1  —  p^nr) 
that  this  function  is  not  decomposable.  When  K/k2n  is 
small,  as  in  Data  Mining  where  it  can  be  less  than  0.01 
percent,  the  probability  of  finding  disjoint  decomposi¬ 
tion  is  then  quite  high,  even  for  random  functions.  Still, 
if  the  disjoint  decomposition  does  not  exist,  addition  of 
every  shared  variable  improves  the  chance  of  finding  a 
decomposition.  The  procedure,  however,  may  become 
inefficient,  so  good  methods  for  finding  bound  and  free 
sets  are  needed,  as  well  as  a  fast  algorithm  for  basic  de¬ 
composition  execution  step  (because  it  is  repeated  very 
many  times) . 


MIN,  MAX  and  Simple  Monotonic  Decomposi¬ 
tions. 

MIN  Decomposition  of  an  incompletely  specified  func¬ 
tion  is  based  on  the  same  principles  as  illustrated  in 
previous  sections  for  complete  functions.  Simplified  al¬ 
gorithm  for  MIN  Decomposition  is  the  following: 

[1]  Find  rows  and  columns  with  pattern  of  only  0’s  and 
don’t  cares,  label  respective  rows  and  columns  by  0. 
Remove  0’s  from  these  rows  and  columns. 

[2]  Find  rows  and  columns  with  compatible  patterns  of 
Ts  and  don’t  cares,  label  respective  rows  and  columns 
by  1.  Remove  l’s  from  these  rows  and  columns.  If  there 
is  more  than  one  compatible  pattern  of  0’s  and  l’s,  exit. 

[3]  Find  rows  with  compatible  patterns  of  2’s  and  don’t 
cares,  label  respective  rows  and  columns  by  2.  Remove 
2’s  from  these  rows  and  columns.  If  some  patterns  re¬ 
main,  exit. 

[4]  The  labeled  rows  and  columns  determine  the  MIN 
operator  pattern. 

To  understand  this  algorithm  the  Reader  is  advised 
to  apply  it  to  our  previous  tables  from  MIN  decomposi¬ 


tion.  Also,  try  to  apply  it  to  the  MIN  table  from  Figure 
3.  Observe  that  similar  algorithm  can  be  created  for 
MAX  decomposition,  but  the  order  of  removals  will  be 
not  [0,1,2]  as  for  MIN  presented  above,  but  [2,1,0].  This 
can  be  checked  on  the  MAX  table  from  Figure  3.  Simi¬ 
larly  it  can  be  checked  in  Figure  3  that  TRUNCATED- 
PRODUCT  operator  (TRUNCATED-PRODUCT  de¬ 
composition)  will  be  found  for  order  [0,2,1],  GREATER 
operator  for  order  [0,1,0],  and  GREATER- EQUAL  op¬ 
erator  for  order  [1,0,1].  All  such  functions,  for  which 
the  decomposition  can  be  checked  by  the  parametrized 
variants  of  the  above  algorithm  with  different  orders  of 
removals,  we  will  call  the  Simple  Monotonic  Functions . 
Functions  for  which  the  removal  can  be  done  partially 
are  also  good  candidates  for  non-disjoint  decomposi¬ 
tions,  and  we  call  them  the  Partially  Monotonic  Func¬ 
tions.  In  tables  from  Figure  3,  GALOIS-PRODUCT 
and  TRUNCATED-SUM  are  such  functions. 
Preprocessing  for  graph  coloring. 

Because  graph  coloring  [8,  9,  11]  can  be  slow,  we  pro¬ 
pose  to  apply  the  following  algorithm  that  in  many 
cases  removes  totally  the  necessity  of  coloring,  and  in 
other  cases  reduces  significantly  the  size  of  the  rows  and 
column  incompatibility  graphs. 

[1]  Combine  all  rows  that  have  the  same  overlapping 
symbols. 

[2]  Combine  all  columns  that  have  the  same  overlapping 
symbols. 

[3]  Iterate  until  no  rows  or  columns  can  be  combined. 

[4]  If  the  size  of  the  resultant  matrix  is  3  x  3,  bi¬ 
decomposition  is  found.  The  decomposition  operator 
is  specified  by  the  matrix.  This  matrix  can  have  don’t 
cares,  thus  specifying  various  decomposition  operators. 
Moreover,  various  functions  g  and  h  can  be  created  from 
the  traces  of  the  combined  (i.e.,  labeled  the  same  way) 
rows  or  columns  in  the  process  of  combining.  If  more 
than  3  rows  or  columns  were  found  that  cannot  be  com¬ 
bined  to  3  patterns,  exit. 

[5]  Execute  two  Graph  Colorings  for  functions  g  and  h. 
For  each,  if  the  chromatic  number  is  larger  than  3,  exit. 
Otherwise  combine  in  each  the  nodes  colored  with  the 
same  color  creating  the  operator  table. 

Example.  Given  is  the  table  of  function  /6(A,  B,  C,  D) 
from  Table  10.  By  combining  rows  table  11  is  created, 
and  next  by  combining  columns  in  it,  Table  12.  Ob¬ 
serve  that  by  combining  first  columns  and  next  rows, 
Tables  13,  14,  another  solution  is  found.  Observe  in 
Table  14  the  final  matrix  specifying  the  ternary  decom¬ 
position  operator.  Note,  the  decomposition  is  differ¬ 
ent  than  in  Table  12  because  row  labels  are  different. 
Because  of  a  don’t  care  in  the  operator  table,  it  corre¬ 
sponds  to  three  different  decomposition  operators;  with 
the  don’t  care  replaced  with  0,  with  1,  or  with  2.  Thus, 
several  solutions  may  be  found,  and  the  one  correspond- 
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ing  to  the  realizable  decomposition  operator  is  next  cho¬ 
sen. 

4  Experimental  Results  and  Conclu¬ 
sions 

Table  15  gives  results  of  bi-decomposition  of  some 
Benchmark  functions  from  POLO  directory  [4,  11,  16] 
(Data  Mining  and  Machine  Learning).  We  tried  decom¬ 
position  for  the  MIN  and  MAX  operator  by  successive 
removal  of  values  as  described  in  section  3  for  simple 
monotonic  functions.  That  is,  function  f(xa,  xbyxc)  = 
MIN/MAX(g(xa ,  xc)y  h(xb ,  arc)).  The  meaning  of  the 
columns  is:  Name  =  Name  of  the  Benchmark,  #/n  = 
Number  of  multi-valued  inputs,  log(In)  =  /o^2(  Product 
of  multiplicity  indices  of  inputs) ,  Out  =  multiplicity  in¬ 
dex  of  output,  #GJV  =  number  of  (multi-valued)  inputs 
of  g  for  MIN ,  #HN  =  number  of  (multi-valued)  inputs 
of  h  for  MIN ,  #GX  =  number  of  (multi-valued)  in¬ 
puts  of  g  for  MAXi  #HX  =  number  of  (multi-valued) 
inputs  of  h  for  MAX.  The  dashes  mean  that  there  is 
no  decomposition.  The  total  computation  time  was  10 
minutes  for  all  Benchmarks  of  the  table  on  a  133  MHz 
Pentium  Processor. 

We  generalized  the  binary  bi-decompositions  dis¬ 
cussed  previously  by  Steinbach  and  Sasao/Butler  for 
the  case  of  arbitrary  variable- valued  logic.  Al¬ 
though  bi-decomposition  is  only  a  special  case  that  can 
be  derived  from  the  general-purpose  Ashenhurst/Curtis 
decomposition,  this  decomposition  is  especially  impor¬ 
tant  in  Knowledge  Discovery,  Data  Mining,  and  auto¬ 
matic  knowledge  acquisition  applications,  because  it 
creates  networks  that  when  converted  to  sets 
of  rules,  are  easier  to  understand.  Moreover,  al¬ 
though  every  function  is  disjointly  or  non-disjointly  de¬ 
composable  this  way,  the  MV  bi-decompositions  are  es¬ 
pecially  efficient  and  effective  for  the  case  of  strongly 
unspecified  functions,  characteristic  for  Data  Mining. 
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Table  10.  Table  for  incomplete  function  /6  with 
labeled  rows  and  columns 
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Table  11.  Combined  rows  for  function  from  Ta¬ 
ble  10 
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Table  12.  Combined  columns  for  function  from 
Table  11,  the  final  matrix  specifying  the 
ternary  decomposition  operator 
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Abstract 

A  function  /  :  Pn  — ►  P,  P  —  {0, 1, . . .  ,p  -  1}  is  k- 
decomposable  iff  /  can  be  represented  as  f{X\,X 2)  = 
g{hi(Xi)M(Xi),...9hk(Xi)9X2),  where  (A1?A2)  is 
a  bipartition  of  input  variables.  This  paper  introduces 
the  notion  of  totally  fc-undecomposable  functions.  By 
using  this  concept,  we  can  drastically  reduce  the  search 
space  to  find  ^-decompositions.  A  systematic  method 
to  find  the  bipartitions  of  input  variables  that  will  not 
produce  any  ^-decompositions  is  presented.  By  com¬ 
bining  it  to  the  conventional  decomposition  methods, 
we  can  build  an  efficient  functional  decomposition  sys¬ 
tem.  This  method  is  promising  to  design  LUT-based 
FPGAs. 

Key  words:  Functional  decomposition,  Symmet¬ 
ric  function,  LUT-based  FPGA,  Multiple-valued  logic 
function. 

I  Introduction 

Decompositions  of  logic  functions  have  been  studied 
for  many  years.  Major  contributions  are  summarized 
as  follows: 

•  Formulations  using  decomposition  tables  [1,  5]. 

•  Formulations  using  compatibility  [12]. 

•  Fast  method  using  Jacobian  [20]. 

•  Applications  to  multi-level  PLA  networks  [13,  6]. 

•  Extension  to  incompletely  specified  functions  [21]. 

•  Computation  of  column  multiplicity  using  BDDs 
[14,8,4], 

•  Application  to  FPGAs  [11], 

•  Bi-decomposition  [15]. 

•  Fast  method  [2,  10]. 

•  Extension  to  multiple- valued  logic  [7,  9]. 

•  Extension  to  multiple-output  functions  [18,  22,  9]. 

In  the  above  contributions,  most  are  related  to  two¬ 
valued  functions.  However,  extensions  to  multiple¬ 
valued  functions  are  quite  natural. 

In  this  paper,  we  will  consider  decompositions 
shown  in  Fig.  1.1.  Given  a  multiple- valued  function 
/  :  Pn  P,  P  =  {0, 1, . . .  -  1},  we  will  consider  the 

problem  whether  /  can  be  represented  as  /(A 1,  A2)  = 
g(hi(Xi),h2(Xi)^..,hk(X1),X2),  or  not. 

Let  n  be  the  number  of  the  input  variables,  then 
we  have  to  consider  nearly  2n  different  bipartitions 
(Xi,X2)  of  the  input  variables  {xi,x2,.. .  When 


k 

H 

| 

— 

_ 

G 

x2 : 

«2 

f(XuX2)  =  g(h1(X1),  h2{Xx),  -  -  - ,  hk(Xx),X2) 

Figure  1.1:  Disjoint  ^-decomposition. 

n  is  large,  the  number  of  bipartitions  to  consider  is  too 
large,  and  the  exhaustive  search  is  impractical. 

This  paper  introduces  the  concept  of  totally  unde¬ 
composable  functions.  By  using  this  concept,  we  can 
drastically  reduce  computation  time  to  find  decompo¬ 
sitions.  This  paper  shows  a  systematic  method  to  find 
the  bipartitions  of  input  variables  that  will  not  produce 
any  decompositions.  By  combining  it  to  the  conven¬ 
tional  decomposition  methods,  we  can  build  an  efficient 
functional  decomposition  system. 

The  rest  of  this  paper  is  organized  as  follows:  Sec¬ 
tion  II  gives  definitions  and  basic  properties  of  func¬ 
tional  decompositions.  Section  III  introduces  the  con¬ 
cept  of  ^-undecomposable  functions.  It  also  derives  a 
theorem  to  find  bipartitions  (Ai,  A2)  that  will  not  pro¬ 
duce  any  ^-decomposition.  Section  IV  shows  a  method 
to  represent  a  set  of  bipartitions  by  using  a  switch¬ 
ing  function.  Section  V  enumerates  the  number  of 
&- undecomposable  functions.  It  also  shows  that,  for 
sufficiently  large  n,  almost  all  functions  are  totally  k - 
undecomposable. 

II  Definitions  and  Basic  Properties 
Definition  2.1  A  p-valued  function  is  a  mapping 
f  :  Pn  — ►  P,  where  P  =  (0, 1, . . .  ,p  -  1}  and  p  >  2.  If 
p  =  2,  /  is  a  switching  function. 

Definition  2*2  Let  the  set  of  the  input  variables  be 
{X}  =  {xi,x2,...,xn}.  (Xi,X2,...,Xr)  is  a  parti- 
tion  of  X  if  {At}  D  {Ay}  =  <f>  (1  <  i  <  j  <  n)  and 
{Ai}  U  {A2}  U  •••  U  {Ar}  =  {A}.  Especially  when 
r  =  2,  the  partition  is  a  bipartition.  The  number  of 
the  variables  in  {A}  is  denoted  by  |A|. 

Definition  2.3  A  p-valued  function  f  has  a  disjoint 
Ar-decomposition  iff  f  is  represented  as  /( Ai,  A2)  = 


0-7695-0161-3/99  $10.00  ©  1999  IEEE 


59 


Xl  =  (xi,X2 
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Figure  2.1:  Decomposition  table. 

g(hi(Xi),h2(X1),...,hk(Xi),X2)>  where  (Xi,X2)  is 
a  bipartition  of  X ,  and  g  and  hi  are  p-valued  functions. 
If  \Xi  \  >k+ 1  and  \X2\  >  1+  [logp  k],  then  the  decom¬ 
position  is  non-trivial,  and  f  is  fc-decomposable, 
where  \a ]  denotes  the  least  integer  not  smaller  than  a. 
We  also  assume  that  functions  with  up  to  two  variables 
are  decomposable .  {Xx}  and  {X2}  are  the  bound  set 
and  the  free  set,  respectively.  Variables  in  {Xx}  and 
{X2}  are  bound  variables  and  free  variables,  re¬ 
spectively.  When  f  is  k- decomposable,  f  is  realized  by 
the  network  shown  in  Fig.  1.1. 

Definition  2.4  If  f  does  not  depend  on  one  or  more 
variables ,  then  f  is  degenerate. 

Note  that  if  /  is  degenerate,  then  /  is  decomposable. 

Definition  2.5  Let  f(X)  be  a  p-valued  function ,  and 
(Xi,X2)  be  a  bipartition  of  X,  where  n\  =  |Xi|  and 
n2  =  |X2|.  The  decomposition  table  of  f  has  pUl 
columns  and  pn 2  rows ,  each  column  has  distinct  p-ary 
label  of  rii  digits,  each  row  has  distinct  p-ary  label  of 
n2  digits ,  and  the  corresponding  entry  of  the  table  rep¬ 
resents  the  value  of  f. 

Example  2.1  Let  f(X)  be  a  function  f  :  {0, 1,2}4  — ► 
{0,1,2},  and  (Xi,X2)  be  a  bipartition  of  X,  where 
Xi  =  (x1,x2)  and  X2  =  (a*,  34).  Fig.  2.1  is  an  ex¬ 
ample  of  a  decomposition  table.  ® 

Definition  2.6  The  number  of  different  column  pat¬ 
terns  in  the  decomposition  table  for  a  bipartition 
(Xi,X2)  is  the  column  multiplicity  and  is  denoted 
by  p(f  :X i,X2). 

Theorem  2.1  A  p-valued  function  /(X)  has  a  dis¬ 
joint  k-decomposition  /(X)  =  g{hi(Xi),h2(X\),. . . , 
hk(X1),X2)  iffli(f:X1,X2)<pk. 

The  size  of  decomposition  tables  for  n  variables 
is  pn ,  and  the  number  of  different  bipartitions  is 
0(2”).  Thus,  the  straightforward  method  to  find  a 
^-decomposition  is  impractical  for  the  functions  with 
many  inputs.  A  method  to  find  decompositions  by 
using  ROBDDs  (reduced  ordered  binary  decision  di¬ 
agrams)  or  ROMDDs  (reduced  ordered  multi-valued 
decision  diagrams)  has  been  developed. 


Figure  2.2:  Computation  of  column  multiplicity  p(/  : 
Xl5X2). 

Theorem  2.2  [14,  8,  7]  Let  (Xx,X2)  be  a  bipartition 
of  X.  Suppose  that  the  ROMDD  for  f(X)  is  parti¬ 
tioned  into  two  blocks  as  shown  in  Fig.  2.2 .  The  num¬ 
ber  of  nodes  in  the  lower  block  that  are  adjacent  to  the 
upper  block  is  equal  to  p(f  :  Xi,X2). 

Lemma  2.1  For  any  bipartition  (Xi,X2)  of  input 
variables  of  a  p-valued  function  f,  1  <  p(/  :  Xi,X2)  < 
min (pni,pp  2)?  where  nx  =  |Xx|  and  n2  =  |X2|. 

(Proof)  The  number  of  columns  in  the  decomposition 
table  is  p”1.  Thus,  we  have  p(/  :  Xx,X2)  <  p”1.  The 
number  of  different  functions  of  n2  variables  is  pp  2 . 
Since  each  column  of  the  decomposition  table  shows 
an  n2 -variable  function,  we  have  p(/  :  Xx,X2)  <  p?  2 . 

□ 

Definition  2.7  Let  /(Xa,Xb)  be  a  function,  where 
\X&  |  =  n^.  Let  as  €  PnB  be  an  assignment  for  Xb - 
Then,  /(X^,a^)  denotes  the  sub-function,  where  the 
values  of  Xb  are  fixed  to  the  constants  Sb  *  /(oa>X jg) 
is  similarly  defined. 

Definition  2.8  Let  X  =  (xi,x2,...,xn)  and  a  = 
(ai,a2,...,a„).  Then, 

^(i  (  —  p  1  if  =:  foT  *  ==  lj  2, . . . ,  n. 

{  =  0  otherwise. 

Lemma  2.2  If  k  >  p\x*\,  then  any  p-valued  function 
is  realized  in  the  network  shown  in  Fig.  1.1. 

(Proof)  Let  n2  =  |X2|.  An  arbitrary  func¬ 
tion  /(Xi,X2)  is  represented  by  /(Xx,X2)  = 

Vaepn 2  /(Xl5  a)X2®,  where  P  -  {0, 1, . . .  ,p- 1}.  Since 
the  number  of  products  in  the  above  expression  is  at 
most  p”2,  we  have  the  lemma.  E 

Example  2*2  When  n2  =  1  and  k  =  p,  any  p-valued 
function  is  realized  in  the  network  shown  in  Fig.  2.3  by 
using  the  following  expansion: 

f(XuX2)  =  x2°  /o  (Xl )  VX2J  /1  (Xl )  V-  •  •  V*r 1  fp-i(Xi). 
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Figure  2.3:  Example  of  a  trivial  ^-decomposition. 


X\  =  (ari,X2,X3) 


Figure  3.1:  Totally  2-undecomposable  function. 


We  assume  that  the  ^-decomposition  in  Lemma  2.2  is 
trivial.  This  is  why  we  assumed  that  n2  >  [logp  fe]  +  1 
in  Definition  2.3. 

Ill  fc-Undecomposable  Functions 

In  this  part,  we  introduce  the  notion  of  to¬ 
tally  &-undecomposable  functions.  We  will  show 
that  if  f(XA,  3b)  is  totally  k- undecomp osable,  then 
/(X^,  Xb)  is  Ar-undecomposable  for  many  bipartitions. 
Thus,  by  finding  totally  fc-undecomposable  subfunc¬ 
tions,  we  can  drastically  reduce  the  search  space  for 
functional  decompositions. 

Definition  3.1  A  p-valued  function  f(X)  is  totally 
A>undecomposable  if  p(f  :  Xi,X2)  >  pk  for  any 
bipartition  (Xi,  X2),  where  \Xi\  >  1  +  k  and  \X2\  > 

1 + ri°gP  • 

Example  3.1  Consider  the  case  where  n  =  3,  p  =  2, 
andk  =  1.  f(xi,  x2,  £3)  =  X\X2  Vx2x3  Vx3ari  is  totally 
l-undecomposable.  ■ 

Example  3.2  Consider  the  case  where  n  =  5,  p  =  2, 
and  k  =  2.  A  five-variable  function  f  shown  in  Fig.  3.1 
is  totally  2-undecomposable,  since  p(f  :  Xi,X2)  >  4 
for  any  bipartitions  with  |Xi|  =  3  and  \X2  \  =  2.  ■ 

Lemma  3.1  Let  ( X\a,X\b,X2a,X2b )  be  a  partition 
of  X ,  where  \Xia\  >  k  +  1  and  \X2a\  >  1  + 
[logpfc].  Let  d\B  and  a2 b  be  assignments  of  Xib  and 
X2b,  respectively.  If  f(XiA,  5ib,X2a,  <?2b)  has  no  k- 
decomposition  of  the  form 

f(XiA,X2A)  =  f(XiA,diB,X2A,a2B) 

=  g(hi  (X1 A),  h2  (Xu), . . . ,  hk(XlA),  X2A), 

then,  f(XiAiXiB,X2A,X2B)  has  no  k- decomposition 
of  the  form 

f(XiAi  X\ B,  X2A ,  X2b) 


=  g(hi (Xi At  X\b ),  h2(XlA, Xi B), . . . , 

^1b),  X2a,X2B)- 

In  this  case,  {Xib}  or  {X2b}  can  be  empty  set(s). 

(Proof)  Assume  that  /  has  a  &- decomposition  of  the 
form 

/(XiA,  X'lB,  X2A>  X2 b) 

=  g(hi  (Xi  a,  Xib),  h2  (Xja,  Xib),  -  ■  • , 
hk{X\A,  Xib),  X2  a,  X2b)* 

Assign  aiB  and  S2b  to  Xib  and  X2b 5  respectively. 
Then,  we  have  the  decomposition  /(Xia,  gib,^2A, 
a2 b)  =  g{h\{X\A,  aiB),h2(XiA,aiB), .. .  ,hk(XiA, 

diB),X2A,d2B)-  However,  this  contradicts  the  as¬ 
sumption  of  the  lemma.  □ 

Example  3.3  Consider  the  case  where  n  =  S,  k  = 
2,  and  p  =  2.  Let  f(x ux2,...,x$)  be  an  8- 
variable  function.  If  (x\,  x2,x%,  0, 1,  #6,  X7, 1)  Aas  no 
2 -decomposition  of  the  form  f(x \,x2,x$,x§,xi)  = 
£2, a?3), ^(ffi,  a2, *3)? *£7)5  then  f(xux2, 
...,x$)  has  no  2- decomposition  of  the  form  f  = 
g{h\(x\,  x2 ,  #3,  #4, 375),  ^2(^15  372,  373,  374,375),  376  7  377,  3*). 
In  this  example,  X\a  =  (xi,  372,x3),  Aib  =  (374,275), 
X2A  =  (xq,Xj),  X2b  =  (378)7  «1B  =  (0,1)7 
a2B  =0.  ® 

Theorem  3.1  Letf  (XajX#)  be  a  partition  of  X, 
where  \Xa\  >  A;  4-  [logp  k~\  +  2  and  |Xb|  >  1.  For  an  as¬ 
signment  a,B,  if  f(X a,  03)  is  totally  k-undecompo sable, 
then  f  has  no  decomposition  of  the  form  f(X i,X2)  = 
g(hi(Xx),h2(Xi),. . .  ,hk(Xi),X2),  where  (Xi,X2)  is 
a  bipartition  of  X,  \{Xa}  H  {Xi}|  >  k  +  1,  and 
\{XA}  PI  {X2}|  >  1  +  [logp  *1. 

(Proof)  Let  XA  =  ( X\a,X2a )  and  XB  =  (Xib,  X2B)- 
Then,  apply  Lemma  3.1,  and  we  have  the  theorem.  □ 

Definition  3.2  Let  (Xi,X2)  be  a  bipartition  of 
{xi,x2,...,xn},  where  X 1  =  {x\,x2,. . .  ,xr)  andX2  = 
(xr+i,:rr+2,...,a7n).  Such  a  bipartition  is  compactly 
denoted  by  the  bipartition  of  integers  (1, 2,  + 

l,r  +  2,...,n). 

Example  3.4  Let  f(xi,x2,x^,x4,xs)  be  a  five-vari¬ 
able  two-valued  function.  If  f(xi,x2,xz,  0,0)  is  totally 
l-undecomposable,  then  f  is  l-undecomposable  for  the 
following  12  bipartitions: 

(1)  2|3,4, 5),  (1,2,4|3, 5),  (1,2,5|3,4),  (1,2,4,5|3), 
(1)3|2,4, 5),  (1,3,4|2, 5),  (1,3,5|2,4),  (1,3,4,5|2), 

(2, 3|1,4, 5),  (2,3,4|1, 5),  (2,3,5|1,4),  (2,3,4,5|1). 

I 
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Theorem  3.2  Consider  a  p-valued  function 
f(XA,Xs),  where  ha  =  \Xa\  >  k  +  [logpfc]  +  2  and 
ns  =  \Xb\  >  1.  For  an  assignment  as,  if  /(Xa,  a b)  is 
totally  k-undecomposable,  then  f  is  k-undecompo sable 
for 


a  switching  function  of  n  variables.  In  this  part,  we 
will  introduce  such  representations.  Also,  bipartitions 
that  will  not  produce  decompositions  are  compactly 
denoted  by  symmetric  functions.  We  also  introduce 
notations  for  symmetric  functions. 


a(nA,nB,p,  k)  = 


nA-l-riogpfcl 

^2  C(nA,i ) 

i=fc+l 


2"B 


Definition  4.1  A  function  f  is  a  totally  symmetric 
function  if  any  permutation  of  the  variables  in  f  does 
not  change  the  function . 


bipartitions. 

(Proof)  F  is  &-undecomposable  when  the  following 
conditions  are  satisfied: 

1)  More  than  k  variables  in  { XA }  are  included  as 
bound  variables. 

2)  More  than  [logp  fc]  variables  in  {XA}  are  included 
as  free  variables. 

3)  Variables  in  {Xb}  can  be  either  in  the  bound  set 
or  the  free  set. 

From  1)  and  2),  we  have  the  first  factor.  And,  from  3), 
we  have  the  second  factor.  O 

Example  3.5  Let  f(x i,x2,x3,x4,X5)  be  a  five-vari¬ 
able  2-valued  function.  If  f{x  1,  #2,  £3, 0,0)  w  totally 
1-undecomposable ,  then  f  is  1-undecomposable  for  a  = 
12  bipartitions,  since  k  =  1,  p  =  2,  n\  =  3,  and  n2  =  2. 
This  is  also  verified  by  Example  3.4  * 


Corollary  3.1  Consider  an  n -variable  function 
f(XA,XB),  where  nA  =  \XA\  >  k  +  |"logp  fc]  +  2?  and 
ub  =  \Xb I  >  1-  For  an  assignment  3b,  if  ( XA ,  3b)  is 
totally  k-undecomposable,  then  we  have  to  check  for  at 
most 


P(nA,nB,P,k)  = 


bipartitions. 

(Proof)  There  are  2n  =  2nA2nB  bipartitions. 
Among  them,  ol{ua,ub,P,  k)  bipartitions  are  k- 
undecomposable.  So,  we  have  to  check  at  most 
P(nA,ns,P,k)  =  2n  -  Oi(nA,nB,P,k)  bipartitions.  □ 

Example  3.6  Corollary  3.1  shows  that  when  p  =  2 
and  k  —  1,  the  fraction  of  (5  to  2”  is  7  =  - 

Therefore,  when  nA  =  3,  7  =  5/8;  when  nA  —  4, 
7  =  3/8;  when  nA  =  5,  7  =  7/32;  and  when  nA  =  6, 
7  =  1/8.  ■ 

IV  Switching  Function  Representing 
Set  of  Bipartitions 

Functional  decomposition  is  to  find  a  bipaxtition 
(XUX2)  such  that  f(XuX2)  =  g(hl(Xl),h2(X1),..., 
hk{X i),X2).  There  are  2n  different  bipartitions  in¬ 
cluding  trivial  ones,  and  these  can  be  represented  by 


Definition  4.2  The  elementary  symmetric  func¬ 
tions  ofn  variables  are 

Sq  =  X\X2  *  *  *  Xn, 

S?  =  X\X2  *  *  *  Xn  V  '  *  *  Xn  V  *  •  •  V  X1X2  *  *  *  Xn—l'En, 

^  and 

5”=x  1x2***x„. 

S”  —  1  iff  exactly  i  inputs  are  equal  to  one.  Let  A  C 
{0, 1,  A  symmetric  function  is  defined  as 

follows: 

53  =  v  s?. 

ieA 

Example  4.1  /(xi,x2,x3)  =  xix2x3  V  xix2x3  V 

5ix2x3  VX1X2X3  is  a  totally  symmetric  function,  f  =  1 
when  all  the  variables  are  one,  or  when  only  one  vari¬ 
able  is  one.  Thus,  f  can  be  written  as  Sf  VSf  =  ^{1,3}* 

I 

Definition  4.3  A  set  of  bipartitions  of  the  input  vari¬ 
ables  {xi,x2,.  .  .,xn}  is  represented  by  a  switching 
function  bp  of  n  variables.  In  bp,  x2-  =  1  denotes  that 
X{  is  in  the  bound  set,  and  x2  =  0  denotes  that  x2-  is 
in  the  free  set.  The  number  of  true  minterms  of  bp  is 
denoted  by  \bp\. 

Example  4.2  Suppose  that  n  =  5.  The  minterm 
Xix2x3 X4X5  denotes  that  x\,  x2,  and  x3  are  in  the 
bound  set,  and  x4  and  X5  are  in  the  free  set.  ■ 

Lemma  4.1  The  set  of  bipartitions  for  trivial  k- 
decompositions  for  n -variable  p-valued  function  is  given 
by 

=  £{0,1, V  £{n-flogp 

(Proof)  When  the  number  of  variables  in  the  bound 
set  is  less  than  k  + 1,  then  it  is  a  trivial  decomposition. 
To  be  non-trivial  ^-decomposition,  at  least  1 +  flogp  fc] 
variables  must  be  in  the  free  set.  So,  if  the  number 
of  variables  in  the  bound  set  is  greater  than  n  —  1  — 
[logp  fc],  then  it  is  a  trivial  decomposition.  □ 

Example  4.3  Let  n  =  10,  p  =  2,  and  k  =  2,  then  the 
set  of  bipartitions  for  trivial  k- decompositions  is  given 
by  «o  =  £|o  1  2  9  10}  *  This  is  explained  as  follows:  If 
the  number  of  bound  variables  is  two  or  smaller,  then 
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the  decomposition  is  trivial,  since  the  module  for  H  has 
two  outputs.  If  the  number  of  variables  in  the  bound 
set  is  9  or  10,  then  the  number  of  free  variables  is  one 
or  zero.  By  Definition  2.3 ,  this  also  corresponds  to 
a  trivial  decomposition.  Thus,  the  number  of  trivial 
decompositions  is  given  by 

|«o|  =  C(10,0)  +  C(10)1)  +  C(105  2) 

+C(10, 9)  +  (7(10,10)  =  67. 

The  set  of  non-trivial  bipartitions  is  given  by  uq  = 
cio  I 

°{3,4,5,6,7,8}' 

Theorem  4.1  Let  f(X)  be  a  p-valued  function,  and 
(XA,XB)  be  a  partition  of  X.  If  f(XA,aB)  — 
f{x  1,472) "•?#?*)  is  totally  k-undecompo sable,  then  f 
has  no  k-decomposition  for  the  bipartitions 

u  =  ^{fc+l,ik+2,...,r-l-riogp  •  •  •  i  Xr)- 

(Proof)  By  Theorem  3.1  and  Lemma  4.1,  /  is  k- 
undecomposable  for  these  bipartitions.  □ 

Example  4.4  Consider  the  case  where  p  =  2,  n  =  5, 
k  =  1  and  r  =  3.  If  f(x i,x2,ar3,0,0)  is  totally  1- 
undecompo sable,  then  f  is  undecompo sable  for  the  bi¬ 
partitions 

u  =  5|(a?i,  #2,  £3)  =  X1X2X3  V  X1X2X3  V  £i£2£3* 

Note  that  u  denotes  the  same  set  of  bipartitions  as  Ex¬ 
ample  3.4.  * 

Example  4.5  Suppose  that  we  have  to  check  whether 
the  given  2-valued  10-variable  function  f(x  1,^2,..., 
xio)  can  be  realized  by  a  network  shown  in  Fig.  4-1- 
In  this  case,  the  straightforward  method  needs  to  check 
all  possible  bipartitions  (X\,X2),  where  \Xi\  =  5  and 
\X2 1  =  5.  This  set  of  bipartitions  is  represented  by 
55°(xi,X2,.  . .  ,£io),  and  the  number  of  bipartitions  to 
consider  is  |*S,5°(a:i,X2, . . .  ,^io)|  =  £7(10,5)  =  252. 
However,  if  f{x\,X2,x^,  x4,  X5,0,0, 0, 0, 0)  is  totally  2- 
undecompo sable,  then  we  need  not  check  for  (7(5,3)  x 
C( 5, 2)  =  100  bipartitions.  This  fact  is  explained  as 
follows:  From  Theorem  4-1,  the  set  of  bipartitions  that 
will  not  produce  2 -decomposition  is  given  by 

Ui  =  Sl(x1,X2,X3,Xi,Xb)S2(xe,X1,XS,Xg,Xio). 

In  u\,  the  first  factor  selects  three  variables  from 
{#1?  #2)^3?  £4,  #5}  as  bound  variables,  and  the  second 
factor  selects  two  variables  from  {a^)  £7)  £8)  £9)  £10} 
as  bound  variables.  For  example,  suppose  that 
{Xi}  =  {#i, £25^35^6)^7}  is  selected  as  a  bound 
set,  and  {X2}  =  {£4, £5, £8) £9, £10}  is  selected 

as  a  free  set.  This  bipartition  (X\,X2)  does 
not  produce  2-decomposition,  since  {x\,X2^x^}  is 
in  the  bound  set  and  {£4  5  £5}  is  in  the  free 
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f(XuX2)  =  g(h1(Xl),h2(Xl),X2) 

Figure  4.1:  2- decomposition  of  10- variable  function. 

set,  and  f(xi,  #2,  £3)  £4  ,£5 ,0,0, 0,0, 0)  is  totally  2- 
undecompo sable.  Note  that  |«i|  =  (7(5, 3)C(5, 2)  = 
10  x  10  =  100. 

In  a  similar  way,  if  f( 0, 0,0, 0, 0,  £7)£8?  £9)  £10) 
is  also  totally  2-undecomposable,  then  the  following  bi¬ 
partitions  need  not  be  checked : 

U2  =  (£6)  £7) £8)  £9?  £10)^2  (*^1 )  *^2)  £3)  £4?  £5)* 

U2  denotes  C( 5,3)  x  (7(5,2)  =  100  bipartitions  for  that 
no  2 -decomposition  exist.  So,  we  need  only  to  check  for 
the  following  bipartitions: 

bp  =  5,5°(a;i,a:2,...,a;io)Mi«2- 

Since  u\  and  U2  are  mutually  disjoint,  we  have  only  to 
check 

\bp\  =  |S5°(a;i,X2,...,a:io)|  ~  lwi|  “  l«2| 

=  £(10,5)- 100 -  100  =  52 

bipartitions.  In  this  case,  we  can  reduce  the  search 
space  into  one  fifth  by  finding  two  subfunctions  that 
are  2-undecomposable.  i 


Theorem  4.2  Let  f(x\,X2, . . .  ,£n-i) «)  be  totally  k- 
undecomposable,  where  a  E  P.  Then,  /(£i,£2)«--) 
xn-i,xn )  is  totally  k-undecompo  sable  iff  f  is  undecom- 
posablefor  the  following  C(n  —  1,  [logp  k~\)+C(n  —  l,fc) 

bipartitions:  “-l-Pag*  (£1,  £2,  -  ••  ,£n-i)£n  V 

Sfc  1  (£l5  £2)  •  •  •  5  £n— l)£n» 

Example  4.6  Consider  the  case  where  n  =  6,  p  =  2, 
and  k  =  2.  Suppose  that  f(x  1,2:2,  £35  £4)  £5)0)  is 
totally  k-undecomposable .  To  show  that  f  is  totally 
undecomposable,  we  need  the  followings:  For  x&  = 
0,  we  have  to  check  for  the  bipartitions  denoted  by 
$4  (£1 ,  £2 )  £3 ,  £4 )  £5 )£6  •  Fig-  4Ma)  sh°ws  an  example 
of  C( 5,4)  bipartitions .  For  xq  =  1,  we  have  to  check 
for  bipartitions  denoted  by  S\{x\,X2,x^,x^,x^)x^. 
Fig.  4'%0})  shows  an  example  of  C( 5,2)  =  10  bipar¬ 
titions.  ■ 
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Figure  4.2: 

V  Number  of  Totally 

A>Undecomposable  Functions 

When  n  is  sufficiently  large,  almost  all  functions 
are  totally  1-undecomposable  [19].  In  this  part,  we 
will  show  that  almost  all  functions  are  also  totally  k- 
undecomposable. 

5.1  p-valued  case 

Lemma  5.1  Let  N<i(n,p,k)  be  the  number  of  n- 
variable  p-valued  k- decomposable  functions.  Then , 


where  A{ni)  =  kpni  +pn2+*.  Note  that  A(ni)  takes  its 
maximum  when  n\  =  k  +  1  and  n\  =  n  —  1  —  [logp  fc], 
and  the  values  of  A(ni)  are  kpk+l  +  p*1"1  and  pn~l  + 
&pfc+1,  respectively.  Thus,  A[n\)  <  pn~l  +  C,  where  C 
does  not  depend  on  n.  So,  (5.3)  is  less  than 

D(n)  =  2 ”(»  -  1  -  flogp  *1  -  (k  +  1  ))ppn  1+0 . 

Let  us  take  the  logarithm  of  D(n ),  and  we  have 
lo gpD(n)  =  nlogp2+\ogp(n-k-\\ogp  fcl-2)+pn-1+(7. 

Since  log^-—  — ►  ^  as  n  — ►  oo,  we  can  conclude  that 
(5.1)  holcfs.  □ 

5.2  Two-valued  case 

When  n  >  4  and  p  =  2,  most  functions  are  totally 
1-undecomposable.  For  k  =  1,  we  obtained  the  values 
of  Nud(n,p,  k)  by  exhaustive  enumeration: 


n -1-  flogp  Jr] 

Nd(n,p,k)<  C(n,n1)pkPni+Pn~ni+h. 

riizzk+l 


(Proof)  Suppose  that  a  function  /  has  a  disjoint  k- 
decomposition  shown  in  Fig.  1.1.  First,  consider  the 
module  H .  The  number  of  ways  to  select  bound  vari¬ 
ables  is  C(n,  ni).  Since  H  has  n\  inputs  and  k  out¬ 
puts,  the  number  of  functions  for  H  is  pkp  1 .  Next, 
consider  the  module  G.  Since  G  has  n2  +  k  inputs 
and  single  output,  the  number  of  functions  for  G  is 
pPn2+k  =  pPn“ni+\  Hence,  we  have  the  lemma.  □ 


Theorem  5.1  Let  NU(i(n,p,k)  be  the  number  of  n - 
variable  p-valued  totally  k-undecompo sable  functions. 
Then , 


NudjrhPi  k) 

pP” 


1  as  n  — *  oo. 


(Proof)  Since  Nud(n,p,k)  -1-  iV<i(n,p,  k)  =  pp  ,  we  will 
show  that 


JVd(ra,p,  fc) 
Pp " 


0  as  n  — ►  oo. 


(5.1) 


From  Lemma  5.1,  we  have 


n  -1-  flogp  *1 

Nd(n,p,k)<  £  C(n,ni)pkp"1+pn2+k .  (5.2) 

Since  C(n,ni)  <  2n,  the  right-hand-side  of  (5.2)  is  less 
than 

n-l-flogpfc]  n  —  1  — flogp  fc] 

2n  PkP"+Pn*+k  =  2n  pA(ni)' 

m=k+l  ni  =  fc+l 

(5.3) 


Nud(  3,2, 1)  =  104, 

7V„d(4, 2, 1)  =  57, 240,  and 
Nud( 5,2, 1)  =  4, 290, 002, 448. 

When  n  =  5,  there  are  22’  =  232  =  4,294,967,296 
functions.  Thus,  99.9%  of  the  functions  are  totally 
1-undecomposable.  The  case  of  k  =  2  is  interesting, 
since  some  FPGAs  have  LUTs  with  two  outputs  [3]. 
The  decompositions  must  satisfy  the  relation: 

|*i |  >  k  +  1  and  \X2\  >  1  +  pog2  *1. 

This  requires  that  \X\  =  \X\  |  -f-  |X2 1  >  k  -f-  2  +  [log2  fc] . 
Thus,  when  k  =  2,  only  the  functions  with  n  >  5  are 
interesting.  For  n  =  5,  the  only  case  is  ni  =  3  and 
n2  =  2,  and  we  have  iVud(5,2,2)  =  3,744,402,432. 
Thus,  87.2%  of  the  5- variable  functions  are  totally  2- 
undecomposable. 

VI  Conclusion  and  Comments 

In  this  paper,  we  defined  totally  &-undecomposable 
logic  functions,  and  showed  a  systematic  method  to 
find  a  set  of  bipaxtitions  that  will  not  produce  disjoint 
^-decompositions.  Key  contributions  are: 

1)  Generation  of  a  set  of  A;-undecomposable  bipar¬ 
titions  from  totally  fc-undecomposable  subfunc¬ 
tions. 

2)  Representation  of  fc-undecomposable  bipartitions 
by  an  n- variable  switching  function. 

3)  Enumeration  of  totally  fc-undecomposable  func¬ 
tions. 

The  presented  method  can  be  extended  to  the  case  of 
incompletely  specified  functions.  This  method  can  be 
combined  to  existing  decomposition  methods  to  reduce 
search  space. 
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When  n  =  3  or  4,  p  =  2  and  fc  =  1,  totally  fc- 
undecomposable  functions  are  easily  detected  by  BDDs 
and  look-up  tables  [17].  By  using  this  method,  we 
can  show  the  undecomposability  of  randomly  gener¬ 
ated  functions  very  quickly. 

We  decomposed  more  than  four  thousand  bench¬ 
mark  functions  including  functions  with  256  inputs  and 
245  outputs  [16,  17].  Experimental  results  for  p  =  2 
and  k  =  1  show  that  for  1-undecomposable  functions, 
the  computation  time  were  reduced  to  up  to  one  hun¬ 
dreds.  Currently,  we  are  developing  a  system  for  k- 
decompositions  with  k  =  2. 

Even  if  the  given  functions  have  two- valued  inputs 
only,  functional  decompositions  with  multi-valued  in¬ 
puts  seems  to  be  useful.  This  is  explained  as  fol¬ 
lows:  Suppose  that  a  completely  specified  two-valued 
input  function  has  a  fc-decomposition  of  the  form 
f(XuX2)  =  g(h1(Xl),g2(Xl),...,gk(Xl),X2),  and 
that  fi(f  :  X\,X2)  <  2fc.  In  this  case,  assigning 
fi(f  :  Xi,X2)  different  binary  vectors  to  the  k  out¬ 
puts  of  H  produces  don’t  care  conditions  for  function  g . 
This  makes  decomposition  problem  very  difficult  [21]. 
However,  if  we  do  not  assign  the  binary  vectors  to  the 
output  of  h ,  but  assume  that  h  produces  a  multiple- 
valued  output,  then  no  don’t  cares  are  generated.  In 
this  case,  the  decomposition  problem  is  easier. 
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Abstract 

Shestakov  expresses  an  incompletely  specified  Boolean 
function  f(x  i, . . .  ,xn)  in  terms  of  Boolean  functions  gu , 
gv  and  h  in  the  form  h(gu(ui, . . . , ur),gv(vi, . . . ,  t>«)), 
where  {ui, . . .  ,ur}  U  {vi, . . . ,  vs}  =  {xi, . . .  ,xn}.  We 
generalize  his  method  to  multi-valued  functions  with  par¬ 
tial  don't  care's  represented  in  a  compact  cube-like  nota¬ 
tion;  we  do  this  using  blankets ,  which  are  generalizations 
of  set  systems.  Luba  and  Selvaraj  express  a  Boolean  func¬ 
tion  f(x  i, . . .  ,xn)  in  terms  of  Boolean  functions  g  and  h 
as  h{uu  •  •  - ,  ur,g(vu  •  • This  method  has  been  for¬ 
malized  using  blankets  by  Brzozowski  and  Luba,  and  gen¬ 
eralized  to  multi-valued  functions  by  Brzozowski  and  Lou. 
The  relations  among  these  methods  are  discussed. 


1  Introduction 

Decomposition  of  Boolean  and  multi-valued  functions 
has  been  studied  by  many  authors  since  the  1950’s;  we  re¬ 
fer  the  reader  to  [2,  7],  where  various  approaches  to  de¬ 
composition  are  discussed.  The  present  paper  is  concerned 
mainly  with  the  method  of  Shestakov  [8,  9],  its  relation  to 
the  techniques  of  Luba  and  Selvaraj  [6]  and  its  generaliza¬ 
tions  [1,  3,  5]. 

Luba  and  Selvaraj  developed  an  approach  to  Boolean 
function  decomposition  using  set  systems  and  their  gener¬ 
alizations  [6].  They  express  a  function  /(x)  in  the  form 
h(u,g(v))>  where  x  =  (xi, . . . ,xn),  u  =  (ui,...,ur), 
v  =  (vi, . . . ,  vs),  X  =  {xi, . . .  ,x„},  U  =  {ui, . . .  ,«r}, 
V  =  {vi, . . . ,  v5},  U  U  V  =  X,  and  g  and  h  are  Boolean 
functions.  Very  similar  methods  were  discovered  indepen¬ 
dently  by  Shestakov  [8,  9],  who  expresses  a  Boolean  func¬ 
tion  /(x)  in  the  form  h(gu(u),gv{v)),  where  u ,  v,  and  x 
are  as  above,  and  pu,  gv,  and  h  are  Boolean  functions.  The 
methods  of  Luba  and  Selvaraj  [5,  6]  have  been  formalized 
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by  Brzozowski  and  Luba  [2],  and  extended  to  the  multi¬ 
valued  case  by  Brzozowski  and  Lou  [1].  In  this  paper, 
we  formalize  Shestakov’s  method  using  blanket  algebra  [2], 
and  extend  it  to  the  multi-valued  case. 

2  Set  Functions  and  Blankets 

A  vector  of  distinct  variables  is  denoted  by  a  letter  in 
lower  case,  and  the  corresponding  set  of  variables,  by  the 
same  letter  in  upper  case.  Thus,  if  s  =  is 

a  vector  of  variables,  then  S  =  {si, . . . ,  sn}  is  the  corre¬ 
sponding  set.  Also,  if  S  =  {si, . . . ,  sn}  is  a  set  of  variables 
explicitly  represented  in  that  order,  then  $  =  ($i , . . . ,  sn)  is 
the  corresponding  vector.  Suppose  T  =  . . . ,  s*r }  is  a 

subset  of  S  with  <i  <  *2  <  •  •  •  <  and  t  =  . . . ,  sir) 

is  the  corresponding  vector.  If  b  =  (61, . . . ,  bn)  is  any  vec¬ 
tor  of  values,  then  bl  is  the  projection  of  b  to  the  variables 
in  t9  i.e.,  bl  =  (bix , . . . ,  bir ).  Vectors  of  values  are  written 
without  parentheses  and  commas.  For  example,  suppose 
s  =  ($i, $2, s35 $4)*  t  =  (sx,s3,s4)  and  b  =  (4, 1,2,0); 
then  b 1  =  (4,2,0).  In  simplified  notation  b  =  4120  and 
b*  =  420. 

Let  E  be  a  finite  nonempty  set,  the  set  of  possible  values 
for  each  variable,  and  let  D  =  2E  —  {0}.  The  set  D  is 
partially  ordered  by  set  inclusion.  For  k  >  0,  an  element 
of  Dk  is  called  a  set  vector.  The  inclusion  partial  order  is 
extended  to  Dk .  For  t ,  tf  €  Dk , 

t  C  tf  if  and  only  if  U  C  t\  for  alH,  1  <  i  <  k. 

The  intersection  of  two  elements  t  and  tf  from  Dk  is  the 
component-by-component  set  intersection.  This  intersec¬ 
tion  t  fl  tf  =  (£1  n  . . . ,  tk  n  t’k)  is  empty  if  one  of  its 
components  is  empty;  otherwise,  it  is  an  element  of  Dk. 

We  use  a  compact  matrix  notation  to  represent  multi¬ 
valued  functions.  A  px(n-fm)  set  matrix  F  is  a  matrix 
with  elements  from  D.  We  associate  a  vector  x  of  input 
variables  with  the  first  n  columns  of  the  matrix;  also  X  is 
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the  corresponding  set  of  input  variables.  Similarly,  y  and  Y 
are  the  vector  and  set  of  output  variables  associated  with  the 
last  m  columns  of  F.  Each  row  t  of  a  set  matrix  is  a  set  vec¬ 
tor  of  length  n  4-  m,  where  the  first  n  components,  denoted 
by  tx ,  are  input  values  and  the  last  m  components,  denoted 
by  ty ,  are  output  values.  By  convention,  to  simplify  matrix 
F  we  leave  out  any  row  which  has  ( E ,  . . . ,  F)  as  its  output. 

An  example  of  a  set  matrix  is  given  in  Table  1,  where 
E  —  {0, 1, 2},  p  =  8,  n  =  5,  and  m  =  2.  For  simplicity, 
curly  brackets  have  been  omitted  from  the  matrix  entries. 

Table  1.  Set  matrix  F 


Row 

Xi 

X2 

£4 

yi 

2/2 

1 

1 

0,1 

0 

0 

0,1 

1 

0 

2 

1,2 

1 

0 

1,2 

2 

0 

1,2 

3 

0,1 

0 

0 

0,1 

0 

1,2 

0 

4 

2 

0,2 

1,2 

2 

2 

0,1 

2 

5 

0,1 

0 

0,2 

1 

1 

2 

1,2 

6 

0 

0 

0,1 

0 

2 

1 

1 

7 

0,1,2 

1 

0,2 

0 

2 

0 

0 

8 

0 

2 

0 

0 

2 

1 

2 

In  the  binary  case,  where  E  =  {0, 1},  a  row  vector  t 
of  F  represents  a  cube,  since  its  entries  are  {0}s,  {l}s, 
and  {0,  l}s,  representing  logic  Os,  Is,  and  “don’t  cares,” 
respectively.  This  is  the  well-known /r  format,  used  in  the 
program  ESPRESSO.  This  notation  is  also  extended  to  the 
multi-valued  case,  where  each  entry  is  (a)  a  singleton  set  (a 
particular  value),  (b)  the  set  E  of  all  possible  domain  values 
(the  complete  don’t  care),  or  (c)  a  nonempty  proper  subset 
of  E  (a  partial  don’t  care).  Borrowing  the  terminology  from 
the  binary  case,  we  refer  to  an  element  e  =  (ei , . . . ,  en)  of 
En  as  a  minterm.  On  the  other  hand,  a  set  vector  of  the  form 
b  =  ({ei}, . . . ,  {en})  £  Dn ,  is  a  singleton  vector.  The  set 
of  all  such  singleton  vectors  is  denoted  by  £n.  By  a  slight 
abuse  of  terminology,  we  also  refer  to  singleton  vectors  as 
minterms.  Furthermore,  we  refer  to  the  set  of  rows  of  a  set 
matrix  F  also  as  F,  and  we  write  t  £  F  if  £  is  a  row  of  F. 
A  minterm  b  £  £n  is  involved  in  a  row  t  £  F,  if  tx  D 
We  call  b  relevant  to  f  if  b  is  involved  in  some  row  t  £  F. 

A  set  matrix  is  said  to  be  consistent  if,  for  any  set  T  of 
rows  of  F, 

implies  Q  ty  ^  0. 
ter  t£T 

Consistency  means  that,  if  the  input  projections  of  the  rows 
in  T  contain  a  common  minterm  6,  then  the  output  projec¬ 
tions  of  that  set  of  rows  also  contain  a  common  minterm. 
From  now  on  we  consider  only  consistent  set  matrices . 

One  verifies  that  the  set  matrix  of  Table  1  is  consistent. 


Only  Rows  1  and  3  share  a  common  input  minterm  10000. 
These  rows  share  the  output  minterm  10. 

We  denote  by  FaOfe  =  {t  £  F  \  tx  D  b}  the  set  of  all 
rows  of  F  involving  a  given  minterm  b .  For  example,  for 
the  matrix  of  Table  1,  Fx3ioooo  =  {1,3}. 

The  interpretation  of  a  (consistent)  set  matrix  F  is  as  fol¬ 
lows.  It  defines  a  multi-valued  function,  called  set  function, 
f  :  £n  -¥  Dm.  For  any  singleton  vector  b  £  £n, 

f(b)=  p|  i»,  if  Fxob  ±  0 

teFxib 

and  f(b)  ~  (F, . . . ,  E)  (i.e.,  m  Fs — the  complete  don’t 
care)  otherwise.  We  find  it  convenient  to  refer  to  the  vec¬ 
tor  of  variables  of  /  as  the  input  of  /,  and  to  the  vector  of 
function  values  as  the  output  of  /. 

For  example,  for  the  matrix  of  Table  1,  /( 10000)  = 

({!},  {0})  n  ({1,2},{0})  =  ({!}>  {o}),  /(00001)  = 

({0,1, 2},  {0,1,2}),  since  00001  does  not  appear  in  any 
row  of  F,  and  /(11012)  =  ({0},  {1, 2}),  from  Row  2. 

In  the  following,  we  briefly  give  the  definitions  and 
notation  used  in  connection  with  blankets;  for  more  de¬ 
tails  see  [2].  A  blanket  on  a  set  5  is  a  collection  /3  — 
{Bi , . . . ,  Bk  }  of  nonempty  and  distinct  subsets  of  5,  called 
blocks ,  whose  union  is  S.  We  write  /3  —  {Bi}  when 
it  is  possible  to  avoid  referring  to  the  number  of  blocks 
in  (3.  Also,  we  write  a  block  with  a  bar  over  it  to  re¬ 
duce  the  number  of  curly  brackets.  Thus,  for  example, 
/ 3  =  {{1, 2},  {2, 3}}  is  a  blanket  on  the  set  {1, 2, 3};  it  will 
be  written  {1,2, 2,3}. 

The  product  (3*f3'  of  two  blankets  is  defined  by 

/?*/?'  =  ne  {Bi  C\Bj  \  B{  £  (3  and  Bj  £  /?'}. 

where  ne  {5*}  =  {S*}  -  {0}  for  a  set  {5*}  of  subsets  of  S . 

For  two  blankets  f3  and  (3f  on  5,  we  write  /?</?'  if  for 
each  B{  in  /?  there  exists  a  Bj  in  /?'  such  that  Bi  C  Bj. 

Assume  the  rows  in  F  are  numbered  1, . . . ,  p.  We  deal 
with  blankets  on  the  set  of  rows  of  F,  but  refer  to  them  as 
blankets  on  the  set  {1, . . . ,  p},  for  convenience.  Suppose  V 
is  an  r-element  subset  of  the  set  X  U  Y  of  input  and  output 
variables.  Blanket  (3V  is  defined  by 

(3V  =  ne{Fv2d  \  d  £  £r}. 

In  particular,  (3X  is  the  input  blanket,  corresponding  to  the 
set  X  of  input  variables,  and  (3y  is  the  output  blanket,  cor¬ 
responding  to  Y. 

For  the  matrix  of  Table  1 ,  let  U  =  {xi ,  ,  x$  }  and  V  = 

{373 ,  x^ ,  x§  },  then 

/3u  =  {37576;  6;  5;  7;  8;  1^5;  1^7;  4;  2^}, 
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/ %  =  {1,3;  1;  6,7,8;  3;  5;  2;  6;  4;  7},  and 

py  =  {7;  2;  M;  M;  5;  3;  5}. 

3  Double  Separation  of  Set  Functions 

Let  X  =  {ari, . . . ,  xn}  be  the  set  of  input  variables  of 
a  set  function  /.  Let  U  and  V  be  two  subsets  of  X  such 
that  U  U  V  =  X .  Without  loss  of  generality,  we  rela¬ 
bel  the  variables  x± , . . . ,  xn  so  that  U  =  {xi , . . . ,  xr  }  and 
V  =  {xn-9+ x, . . .  Thus  for  a  vector  x  with  n  com¬ 

ponents,  the  vector  of  the  first  r  components  is  xu  and  the 
vector  of  the  last  s  components  is  xv .  We  now  define  the 
concept  of  separation,  which  is  like  decomposition,  but  for 
which  no  attention  is  paid  to  the  number  of  inputs  in  the 
component  functions.  In  a  decomposition,  the  component 
functions  should  have  fewer  inputs  than  the  function  being 
decomposed.  Since  there  is  a  natural  relationship  between 
blankets  and  separations,  we  consider  separations  first.  Size 
constraints  can  be  added  separately. 

Definition  1  Let  f  be  a  set  function  specified  by  set  matrix 
F  with  n  >  0  inputs  and  m  >  0  outputs ,  and  let  ( U ,  V) 
be  as  above .  Let  gu  be  a  set  function  with  r  inputs  and  p 
outputs ,  let  gv  be  a  set  function  with  s  inputs  and  q  outputs , 
and  let  hbe  a  set  function  with  p  +  q  inputs  and  m  outputs. 
The  triple  (gu  ,gv,h)  is  a  double  separation  off  with  respect 
to  (£/,  V),  if  for  every  minterm  b  £  £n  relevant  to  /,  we 
have  gu(bu)  £  £p,  gv(bv)  £  E9  and 

f(b)  D  h{gu{bu),gv(bv)). 

To  ensure  that  the  inputs  to  h  are  well  defined,  we  require 
that  gu(bu)  and  gv{bv)  be  well  defined,  i.e.,  that  they  should 
be  in  £p  and  £9  respectively.  Note  that  /  and  h  are  not 
equal,  since  some  don’t  cares  of  /  can  become  more  speci¬ 
fied  in  ft,  i.e.,  become  smaller  sets  of  values. 

Suppose  we  are  given  a  consistent  set  matrix  F(x)  with 
n  inputs,  a  set  V  C  X  of  cardinality  s ,  and  a  set  function 
g(v)  with  s  inputs  and  p  outputs.  Furthermore,  assume  that 
g(bv)  £  £p  for  all  minterms  b  £  £n  that  are  relevant  to  F. 
Let  e  £  £p,  and  let 

Fw=e  =  {t  £  F  I  3d  £  £*  :  tv  D  d  and  0(d)  =  e) 

be  the  set  of  all  rows  of  F  that  can  produce  the  value  e  for 
output  vector  w.  Note  that  Fw=e  is  always  a  union  of  blocks 
of  the  form  Fv3d-  Blanket  is  defined  by 

19  =ne{Fw=e}. 

Note  that  jg  is  uniquely  determined  by  F,  V,  and  g . 


Theorem  1  Let  f(x)  be  a  set  function  specified  by  a  set 
matrix  F,  and  let  (17,  V)  be  a  pair  of  subsets  ofX  satisfy¬ 
ing  U  U  V  —  X.  For  every  pair  of  blankets  09u  and  /39v 
satisfying  the  conditions 

•  Pu  ^  $9u>  fiv  —  fidv' 

•  figu  *Pgv  ^  Py> 

there  is  a  double  separation  (gu,  9v,  ft)  off  such  that 
y9u  <Pg»  ondygv  </39v. 

Intuitively,  the  first  condition  states  that  the  information 
present  in  the  inputs  in  vector  u  should  be  sufficient  to  com¬ 
pute  the  outputs  of  function  gu.  Similarly,  the  second  condi¬ 
tion  states  that  vector  v  should  be  sufficient  to  compute  the 
outputs  of  function  gv .  The  third  condition  requires  that  the 
information  contained  in  the  outputs  of  gu  and  gv  should  be 
sufficient  to  reproduce  output  y.  The  conditions  i9i  <  pg. 
can  be  interpreted  as  stating  that  function  gi  corresponds  to 
blanket  pg. .  Often,  y9i  =  p9i ,  but  the  inequality  permits  us 
to  use  more  blankets. 

The  separation  in  [2]  is  a  special  case  of  the  separation 
described  here.  The  proof  of  Theorem  1  is  constructive  [4], 
and  similar  to  the  corresponding  proof  in  [2].  The  converse 
of  Theorem  1  is  also  true,  provided  U  and  V  are  disjoint;  the 
proof  is  similar  to  the  one  in  [2].  Thus  the  counterexample 
for  the  converse  of  the  theorem  in  that  paper  also  serves  as 
a  counterexample  here,  when  U  and  V  are  not  disjoint. 

We  illustrate  the  construction  required  in  the  proof  of 
Theorem  1  using  the  function  of  Table  1,  u  =  (#i,  #2,  #3)* 
v  —  (x3,  x4,  £5)*  and  blankets 

/ 19u  =  {T737576;  T727r;  478},  and 

pgv  =  {T7§74;  6X8;  275}. 

We  can  check  that  these  blankets  satisfy  the  conditions 
of  Theorem  1.  Each  blanket  has  only  three  blocks;  thus 
we  can  code  the  blocks  of  each  blanket  by  a  single  ternary 
variable,  whose  value  is  shown  above  each  block.  Function 
gu  is  shown  in  Table  2  in  a  somewhat  compacted  form  to 
save  space.  Consider  (x1.x2.x3)  =  000.  Minterm  000 
appears  in  rows  3,  5,  and  6  of  F,  i.e.,  it  corresponds  to 
block  {3,5,6}  of  /?„.  Since  (3U  <  /?9„,  each  block  of 
/3U  is  contained  in  at  least  one  block  of  (3gu .  In  this  case 
{3, 5, 6}  C  {1, 3, 5, 6}.  Since  {1, 3, 5, 6}  has  been  assigned 
the  value  0,  we  set  gu(000)  =  0.  The  remaining  entries  are 
computed  similarly. 

Function  gv  is  computed  in  the  same  way;  it  is  shown  in 
Table  3. 

To  construct  function  h  =  h(wv,wv),  we  first  find  block 
B  of  P!)u  corresponding  to  input  wu  and  block  B'  of  /3g„ 
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Table  2.  Set  matrix  defining  gu 

Xi  X2  £3  wu 

I 0  0  0,1,2  0 

0  1  0, 2  1 

0  2  0  2 

1  0  0,2  0 

1  1  0,2  1 

2  0  1,2  2 

2  1  0,2  1 

2  2  1,2  2 


The  form  ft(ui, . . . , urig(vi, . . . ,  v*))  described  in  [2] 
is  called  a  single  separation .  It  is  a  special  case  of  the  sep¬ 
aration  above,  where  function  gu  is  the  identity  function. 
Double  separation  is  equivalent  to  two  single  separations 
performed  in  sequence.  The  verification  of  the  following 
proposition  is  routine  [4]. 

Proposition  1  Let  f  be  a  set  function  and  let  (gu,9v,h)  be 
a  double  separation  off .  Let  r,  q  and  m  be  as  before.  Then 
there  exists  a  set  function  h '  with  r+q  inputs  and  m  outputs 
such  that  (gVl  hf)  is  a  single  separation  off  and  (gu,h)  is 
a  single  separation  ofh '. 


corresponding  to  input  wv.  Next  we  calculate  the  inter¬ 
section  B  =  B  n  Bf.  The  output  is  then  the  intersec¬ 
tion  of  the  outputs  of  the  rows  contained  in  B.  If  B  n  Bf 
is  nonempty  then  it  is  contained  in  a  block  of  (3y,  since 
fig*  *Pgv  <  Py  Thus,  the  intersection  of  the  output  vectors 
is  also  nonempty.  For  example,  for  wuwv  =  00,  the  block 
of  /3gu  is  B  =  1, 3, 5, 6  and  the  block  of  f39v  is  Bf  =  1, 3, 4. 
The  intersection  of  the  two  blocks  is  B  =  1,3.  Hence  the 
output  is  ({l},{0})n  ({1, 2},  {0})  =  ({1},  {0}).  The  rest 
of  the  table  for  h  follows  similarly. 

Table  3.  Set  matrix  defining  gv 

£3  Xi  £5  wv 

I 0  6  M  0~ 

0  0  2  1 

0  10  0 

0  1  1,2  2 

0  2  2  2 

1,2  0  2  1 

1,2  2  2  0 

2  11  2 


The  table  of  h  for  our  example  is  shown  in  Table  4. 


Table  4.  Set  matrix  defining  h 


The  reader  can  verify  Proposition  1  for  our  running  ex¬ 
ample  by  constructing  a  function  h!  for  a  single  separation. 

4  Table  Reduction  Method 

We  now  describe  Shestakov’s  method  [9]  for  finding 
blankets  (3Su  and  (39v .  The  method  can  be  used  for  both 
single  and  double  separations.  We  first  derive  a  single  sepa¬ 
ration  and  then  a  double  one.  Note  that  our  single  separation 
is  of  the  form  h{gu(u),v),  so  the  usual  roles  of  U  andV  are 
reversed. 

For  simplicity,  we  use  a  binary  function  as  an  example. 
Let  /  be  a  Boolean  function  with  five  inputs  and  two  outputs 
specified  by  the  matrix  in  Table  5.  Note  that  $  is  a  short 
hand  for  {0, 1}  and  curly  brackets  are  omitted  for  all  other 
entries.  This  example  is  taken  from  [9]. 


Table  5.  Matrix  defining  f. 

Row  £X  x2  £3  Xj  £5  yi  y2 


1  0 

2  0 

3  1 

4  $ 

5  1 

6  0 

7  0 

8  1 


0  1 
1  1 
$  0 
1  $ 
1  1 
$  0 
0  1 
$  0 


0  1 

$  $ 

0  1 

1  0 

0  # 

$  1 

1  1 

1  1 


0  1 
1  1 
$  0 
$  1 
0  1 
0  0 
1  0 
$  1 


Wu  Wv  Z\  Z2 

"0  0  I  0~ 

0  1  11 

0  2  2  1,2 

10  10 

11  0  0 

12  0  1,2 

2  0  0, 1  2 

2  1  12 


Let  U  =  {£1, £2,  £3}  and  V  =  {£3,  £4,  £5}.  We  calcu¬ 
late  (3U  and  /3„  and  label  their  blocks  by  the  integers  corre¬ 
sponding  to  the  binary  input  combinations.  Hence,  for  ex¬ 
ample,  block  B5  is  missing  from  /?„  as  (£1 ,  £2,  £3)  =  101 
is  not  present  in  the  matrix  defining  /. 

Bo  B\  B2  B3  B4  Be  B7 

pu  =  {  6  ;  1, 7;  4, 6;  2, 4;  3, 8;  3, 4, 8;  4^}, 

Eft  Tjt  rjt  ryt  n/  T}/ 

1  "2  "3  U 6  a7 

Pv  =  {3, 6;  4;  6, 8;  2, 5;  1,2,5;  2,4;  2,7}. 
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We  construct  a  block  multiplication  table  where  rows  are 
labeled  Bi  and  columns  S'.  To  define  the  entries  in  the 
table,  we  need  the  following  definitions. 

Definition  2  Let  W  be  a  subset  of  X  with  k  elements .  Let 
B  be  a  block  in  blanket  (3W.  Then  beXk  is  a  characteristic 
minterm  ofB  ifB  is  the  set  of  all  rows  of  F  that  involve  b, 
i.e.,  ifB  =  Fwob- 

For  example,  if  W  —  U  for  the  function  of  Table  5,  then 
100  is  a  characteristic  minterm  of  B±,  but  110  is  not,  since 
Fu2ioo  =  {3,8}  =  £4  andFUDiio  =  {3,4,8}  =  B6. 

Definition  3  Given  a  block  Bi  of  (3U  and  a  block  Bj  of  / 3V , 
we  define  Bi  A  £'  =  B{  0  B\  if  Bi  fl  B\ \±  0  and  there  exists 
a  minterm  b  relevant  to  f  such  that  bu  is  a  characteristic 
minterm  of  Bi  and  bv  is  a  characteristic  minterm  of  Bj. 
Otherwise ,  Bi  A  Bj  =  0. 

Intuitively,  the  operation  A  is  intersection  if  there  is  a 
common  minterm  which  defines  both  blocks,  and  is  empty 
otherwise.  In  fact,  we  can  show  that  in  the  case  U  D  V  =  0, 
this  operation  is  equivalent  to  block  intersection.  Entry  Tij 
in  table  T  is  now  defined  to  be  Bi  A  Bj.  The  block  multi¬ 
plication  table  for  our  function  is  presented  in  Table  6.  For 
clarity,  empty  sets  are  represented  by  —  and  curly  brackets 
are  omitted  for  all  other  entries. 

Table  6.  Block  multiplication  table 


Note  that  T2j6  is  empty  instead  of  B2  fl  B'6  =  {4}.  This 
is  because  all  characteristic  minterms  of  B2  satisfy  x2  =  0, 
while  all  characteristic  minterms  of  B$  satisfy  x2  =  1;  thus 
minterm  b  of  our  definition  does  not  exist,  and  B2  A  Bf6  = 
0.  Now  we  construct  the  output  table  by  replacing  Tij  by 
riteXi  ^  ?i}j  is  not  empty,  and  by  the  complete  don’t 
care,  otherwise.  We  use  —  to  denote  the  complete  don’t 
care.  The  output  table  for  our  example  is  shown  in  Table  7. 

Definition  4  A  subset  S  =  {Bix , . . . ,  Bik }  of  blocks  of  / 3U 
is  called  compatible  if  in  each  column  of  the  output  table , 
the  entries  in  rows  *1 , . . . ,  ijt  have  a  nonempty  intersection. 

We  now  find  a  blanket  0b  i  on  the  set  0U\  the  blocks  of 
0Bi  are  compatible  sets  of  blocks  of  0U.  We  obtain  a  blan¬ 
ket  0  on  F  by  replacing  each  block  with  the  union  of  its 


elements.  We  then  use  blanket  (3  as  03u  to  obtain  a  single 
separation.  Note  that  this  is  equivalent  to  Luba  and  Sel- 
varaj’s  method  [6],  where  blocks  are  merged  in  0U  to  get  0 
such  that  0V*0  <0y  For  some  graph  theoretical  methods 
for  finding  0  see  [7]. 

In  our  example,  one  possible  blanket  0b,  on  the  set  0U  is 
0B.  =  {B0,B1,B2,Br,  B3)B4,B6}.  The  corresponding 
blanket  03u  with  one  possible  encoding  is 

0gu  =  {1,4,5,6,75  OT}. 

We  can  check  that  (3Qu  */3v  <  /3y,  since 

Py  =  {376;  1,4, 5, 7;  3^;  2^8}. 

Thus  (39u  corresponds  to  a  single  separation  of  / . 

Denote  the  block  of  f39u  encoded  0  by  So  and  the  block 
encoded  1  by  Si.  We  construct  the  row-merged  output  table 
as  follows.  The  rows  of  the  table  are  labeled  Si  and  the 
columns,  Bj.  The  entry  (l;j)  of  the  table  is  the  intersection 
of  entries  (ik,j)  in  the  row  compatibility  table  where  Si  = 
{Bik }.  This  table  is  shown  in  Table  8. 


Table  8.  Row-merged  output  table 


B'2 

B'z 

B'a 

£5 

B'e 

B\ 

So 

00 

$1 

00 

01 

01 

$1 

10 

Si 

$0 

$1 

$1 

11 

11 

11 

11 

Now  we  use  a  similar  procedure  to  find  compatible 
columns.  From  our  table,  we  obtain  the  following  blan¬ 
ket  on  the  set  (3V:  /?#'  =  {B[\  B*2,  B±,  B*h,  B§\  B’z\  B'7}. 

The  corresponding  blanket  / 39v  with  one  possible  encoding 
00  01  10  11 
is/3flv  ={3,6;  1,2, 4, 5;  6,8;  2,7}. 

We  label  the  four  blocks  <%,..., £3.  After  merging 
columns  we  obtain  Table  9.  We  can  get  function  h  directly 
from  this  table:  each  input  combination  of  h  is  formed  by 
concatenating  the  encodings  of  Si  and  Sj,  while  the  output 
vector  is  the  entry  (i,  j)  of  the  table. 
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Table  9.  Final  Table 


S'o 

S'l 

s'2 

S'z 

So 

00 

01 

00 

10 

Si 

$0 

11 

$1 

11 

Note  that  we  could  have  merged  columns  before  rows  in 
Table  7.  This  is  equivalent  to  interchanging  sets  U  and  V.  If 
we  do  this,  we  get  ]3B>  =  {B£ ,  B' ,  B' ,  B£;  B'2,  B’r;  B' }, 

which  results  in  /39v  =  {1,2,3,4,576;  2,4,7;  678}. 

We  then  find  (3Bi  =  {B0,B2;  Bi,B7;  B3;  B4,B6}, 
and/?5u  =  {4,6;  1,4, 5, 7;  2,4;  3,4,8}. 

Table  10.  New  Final  Table 


S'o 

<*1 

S'2 

So 

00 

$1 

00 

Si 

01 

10 

— 

62 

11 

11 

— 

S3 

$0 

$1 

$1 

The  new  final  table  is  shown  in  Table  10.  Note  that  this  is 
a  4  x  3  table  as  opposed  to  the  2  x  4  table  we  obtained  earlier. 
Thus  the  order  in  which  we  reduce  our  table  is  important. 
In  our  example,  the  set  of  tables  for  our  second  double  sep¬ 
aration  would  have  larger  total  size  than  the  tables  for  our 
first  double  separation. 

Theorem  2  The  table  reduction  method  described  above 
results  in  a  double  separation  of  the  original  function  f. 

Proof.  Let  6  be  a  minterm  relevant  to  /.  Then  bu  is  a  char¬ 
acteristic  minterm  of  a  block  B{  in  blanket  (3U  and  bv  is 
a  characteristic  minterm  of  a  block  B}  in  blanket  (3V .  We 
know  that  Bi  D  B'  ^  0,  as  the  row  that  contains  b  is  in  the 
intersection.  By  definition,  we  have  Bi  A  B}  =  B*  D  B' .  In 
our  final  table  obtained  using  the  procedure,  there  is  an  en¬ 
try  corresponding  to  the  value  h  ( gu  ( bu ) ,  gv  (bv ) ) .  This  entry 
is  the  intersection  of  some  output  vectors,  one  of  which  is 
the  entry  Tij  in  the  output  table.  Since  that  entry  is  simply 
/(6),  we  get  f(b)  D  h(gu(bu),gv(bv))  as  required. 

5  Conclusions 

We  have  formalized  Shestakov’s  decomposition  method 
using  blanket  algebra,  we  have  generalized  it  to  multi¬ 
valued  functions  represented  by  cubes,  we  have  related  it 
to  the  work  of  Luba  and  Selvaraj,  and  we  have  proved  its 
correctness. 

We  close  by  mentioning  some  related  issues.  One  prob¬ 
lem  that  we  have  not  discussed  so  far  is  the  selection  of  the 


sets  U  and  V  for  a  decomposition.  This  can  be  done  by  trial 
and  error;  one  can  also  use  certain  necessary  conditions  on 
blankets  f3u  and  f3y  to  reduce  the  search.  We  refer  the  reader 
to  [2]  for  more  details. 

Further  theoretical  issues  related  to  blanket  algebra  and 
its  application  to  function  decomposition  are  discussed 
in  [1,  2].  Blanket  methods  for  decomposition  have  been  im¬ 
plemented  by  Nowicka  and  Luba  in  a  package  called  DE- 
MAIN.  Implementation  issues,  experimental  results  and 
comparisons  with  other  decomposition  methods  are  dis¬ 
cussed  in  [2,  7]. 

Acknowledgment:  This  research  was  supported  by  the  Natural 
Sciences  and  Engineering  Research  Council  of  Canada  under  a 
postgraduate  scholarship  and  grant  No.  OGP0000871.  This  work 
was  done  while  the  first  author  was  at  the  University  of  Waterloo. 

Note:  References  [1,  2,  3,  4,  7]  can  be  obtained  from  the  web  at 
http : / /maveric . uwaterloo . ca/publication . html 

References 

[1]  J.  A.  Brzozowski  and  J.  J.  Lou,  Blanket  Algebra  for  Mul¬ 
tiple-Valued  Function  Decomposition,  Algebraic  Engineer¬ 
ing,  Proc.  Workshop  on  Formal  Languages  and  Comp.  Sys¬ 
tems,  Kyoto,  Japan,  24-28  March  1997,  C.  L.  Nehaniv  and 
M.  Ito,  eds.,  World  Scientific,  1998. 

[2]  J.  A.  Brzozowski  and  T.  Luba,  Decomposition  of  Boolean 
Functions  Specified  by  Cubes,  Part  I:  Theory  of  Serial  De¬ 
compositions  Using  Blankets,  Research  Report  CS-97-01, 
Dept,  of  Comp.  Sci.,  University  of  Waterloo,  Waterloo,  ON, 
Canada,  Jan.  1997;  REVISED  October  1998. 

[3]  J.  J.  Lou,  Decompositions  of  Multi-Valued  Functions,  Mas¬ 
ter  of  Math.  Thesis,  Dept,  of  Comp.  Sci.,  University  of  Wa¬ 
terloo,  Waterloo,  ON,  Canada,  August  1998. 

[4]  J.  J.  Lou  and  J.  A.  Brzozowski,  A  Formalization  of  Shes¬ 
takov's  Decomposition,  Research  Report  CS-98-03,  Dept,  of 
Comp.  Sci.,  University  of  Waterloo,  Waterloo,  ON,  Canada, 
February  1998. 

[5]  T.  Luba,  Decomposition  of  Multiple- Valued  Functions,  Proc. 
25th  International  Symposium  on  Multiple-Valued  Logic, 
Bloomington,  Indiana,  pp.  256-261,  May  1995. 

[6]  T.  Luba  and  H.  Selvaraj,  A  General  Approach  to  Boolean 
Function  Decomposition  and  its  Applications  in  FPGA- 
Based  Synthesis,  VLSI  Design,  Vol.  3,  Nos.  3-4,  pp.  289- 
300,  1995. 

[7]  M.  Nowicka  and  T.  Luba,  Decomposition  of  Boolean  Func¬ 
tions  Specified  by  Cubes ,  Part  II:  Practical  Results,  Research 
Report  CS-97-01  REVISED,  Dept,  of  Comp.  Sci.,  Univer¬ 
sity  of  Waterloo,  Waterloo,  ON,  Canada,  October  1998. 

[8]  E.  Shestakov,  Decomposition  of  Systems  of  Completely  De¬ 
fined  Boolean  Functions  by  Argument  Covering,  Automatic 
Control  and  Comp.  Sci.,  Vol.  28,  No.  1,  pp.  12-20,  1994. 

[9]  E.  Shestakov,  Decomposition  of  Systems  of  Incompletely 
Defined  Boolean  Functions  by  Argument  Cover,  Automatic 
Control  and  Comp.  Sci.,  Vol.  28,  No.  6,  pp.  4-15,  1994. 


71 


Gigantic  Pairs  of  Minimal  Clones 

Ivo  G.  Rosenberg*  and  Hajime  Machida! 


*  Department  de  mathematiques  et  de  statistique, 
Universite  de  Montreal,  Montreal,  Quebec  H3C  3J7  Canada 

f  Department  of  Mathematics, 

Hitotsubashi  University,  Kunitachi,  Tokyo  186-8601  Japan 


Abstract 

In  1992,  L.  Szabo  asked  for  the  minimal  number 
n  =  n(|j4|)  such  that  the  clone  of  all  operations 
on  A  can  be  generated  as  the  join  of  n  minimal 
clones.  He  showed,  e.g.,  n(p)  =  2  for  any  prime 
p,  and  later  G.  Czedli  proved  in  1998  that  if  k 
has  a  divisor  >  5  then  n(k )  =  2. 

In  this  paper,  a  pair  (/,  g)  of  operations  is 
called  gigantic  if  each  of  f  and  g  generates  a 
minimal  clone  and  the  set  {/,<?}  generates  the 
clone  of  all  operations .  First,  we  give  a  general 
theorem  to  characterize  a  gigantic  pair.  Then 
we  show  that  n(k)  —  2  for  every  k  which  is  not 
a  power  of  2. 

1  Introduction 

Let  A  be  a  finite  set  with  k(>  1)  elements. 
Let  O ^  be  the  set  of  all  n-ary  operations 

from  An  into  A  and  let  Oa  =  iXLi^T^* 
Denote  by  Ja  be  the  set  of  all  projections 
pr?  (1  <  i  <  n)  over  A  where  pr”  is  de¬ 
fined  as  pr?{xi , . . . ,  Xi, ... ,  *„)  =  x{  for  every 
(xi, . . .  ,xn)  in  An.  Often,  the  index  A  is  omit¬ 
ted  from  Of  or  Oa. 

A  subset  C  of  0A  is  a  clone  on  A  if  (i)  C  con¬ 
tains  JA  and  (ii)  C  is  closed  under  (functional) 
composition.  The  set  of  all  clones  on  A  is  a  lat¬ 
tice  with  respect  to  the  inclusion  relation.  It  is 
called  the  lattice  of  clones  on  A  and  is  denoted 
by  Ca- 

In  the  lattice  an  atom  of  C  A  is  called 
a  minimal  clone.  In  other  words,  a  clone  C  on 
A  is  a  minimal  clone  if  (i)  C  ^  Ja  ancl  (ii) 
JaCC'  CC  implies  C"  =  C  for  any  clone  C' 
on  A.  Dually,  a  coatom  of  £  a  is  called  a  maxi¬ 


mal  clone. 

In  contrast  to  maximal  clones  which  have 
been  completely  determined  in  terms  of  rela¬ 
tions  ([Ro  65],  [Ro  70A]),  minimal  clones  are 
not  yet  fully  known  in  spite  of  some  extensive 
efforts.  The  known  facts  include,  among  others, 
the  complete  list  of  minimal  clones  on  a  three- 
element  set  ([Cs  83])  and  the  type  classification 
of  minimal  clones  on  arbitrary  A  ([Ro86]).  For 
other  informations  on  minimal  clones,  refer  to 
[Pa  86],  [Qu  95],  [Szc  96],  etc.  In  relation  to 
minimal  clones,  the  authors  [MR  93]  studied  es¬ 
sentially  minimal  clones  which,  by  definition,  sit 
at  the  bottom  of  £  a  excluding  clones  generated 
by  unary  operations. 

L.  Szabo  raised  the  question  in  [Sza  92]:  De¬ 
termine  the  minimal  number  n  =  n(k)  such  that 
the  clone  of  all  operations  on  A  can  be  gener¬ 
ated  as  the  join  of  n  minimal  clones.  He  as¬ 
serted  that  2  <  n(k)  <  3  for  every  k  >  1  and 
showed  n(jp)  —  2  for  any  prime  p.  Later,  G. 
Czedli  [Cz  98]  proved  that  if  k  has  a  divisor  >  5 
then  n(fc)  =  2. 

In  this  paper,  after  defining  the  term  gigantic 
for  a  pair  (/,  g)  of  operations  in  0A  (Definition 
1.2),  we  give  a  theorem  to  characterize  a  gigan¬ 
tic  pair  (Section  2)  and  prove  the  existence  of  a 
gigantic  pair  for  every  k  that  is  not  a  power  of 
2  by  explicitly  constructing  an  example  of  such 
pair  (Section  3). 

Now  it  is  obvious  that  a  minimal  clone  can 
be  generated  by  a  single  operation. 

Definition  1. 1  An  operation  f  on  A  is  min¬ 
imal  if  (i)  it  generates  a  minimal  clone  and  (ii) 
every  operation  from  [/]  whose  arity  is  smaller 
than  the  arity  of  f  is  a  projection.  (Here,  [/]  is 
the  clone  generated  by  f .) 
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The  types  of  minimal  operations  are  known 
in  [Ro  86].  An  operation  f  £  O  is  idempotent 
if  /(x, . . . ,  x)  «  x.  (Here,  /(x, . . . ,  x)  «  x  means 
/(x, . . . ,  x)  =  x  for  all  x  £  A.) 

Theorem  1- 1  ([Ro  86])  Every  minimal  oper¬ 
ation  is  of  one  of  the  following  five  types: 

1)  Unary  operations  f  £  O M  such  that  ei¬ 
ther  (i)  /2(=  /  o  /)  =  /  or  (ii)  f  is  a  permuta¬ 
tion  of  prime  order  p. 

2)  Idempotent  binary  operations;  i.e.,  /  £ 
O W  such  that  /(x,x)  «  x. 

3)  Majority  operations ;  i.e.,  /  £  such 
that  f(x,x,y)  «  f(x,y,x)  «  f(y,x,x)  «  x  . 

4)  Semiprojections  (or  quasiprojections);  i.e., 
f  £  qM  (3  <  n  <  k)  such  that  there  exists 
i  (1  <  i  <  n)  satisfying  /(oi, . . . ,  an)  =  a*  when¬ 
ever  a\ , . . . ,  an  £  A  are  not  pairwise  distinct 

5)  If  k  —  2m,  the  ternary  operations 
f(x,y,z)  «  x  +  y  +  z  where  <  A  ;  +  >  is  an 
elementary  2-group  (i.e.,  the  additive  group  of 
an  m-dimensional  vector  space  over  GF( 2 ) ). 

Inspired  by  [Cz  98],  we  give  the  following  def¬ 
inition. 

Definition  1.  2  A  pair  ( f,g )  of  minimal  oper¬ 
ations  is  called  gigantic  if  {/,  g)  is  complete, 
(i.e.,  [/, g]  —  O  or,  equivalently ,  <  A  \f,g  >  is 
primal.) 

Before  we  proceed,  we  shall  review  some  ter¬ 
minology  from  universal  algebra. 

Definition  1.3  A  =  <  A;  F  >  is  an  alge¬ 
bra  if  A  is  a  set  and  F  is  a  set  of  finitary 
operations  defined  over  and  taking  values  in  A. 
When  F  —  {/i,...,/t},  A  is  also  expressed  as 
<  A;  /i,...,/t  >. 

Let  A  =<  A;  F  >  be  an  algebra. 

A  subset  B  of  A  is  a  subuniverse  of  A  if 
B  is  closed  under  every  operation  f  in  F.  A 
subuniverse  B  of  <  A;  F  >  is  a  proper  subuni¬ 
verse  if  </)  C  B  C  A. 

An  equivalence  relation  0  on  A  is  a  congru¬ 
ence  of 

A  if  6  satisfies  the  property  that  for  every  op¬ 
eration  f  in  F,  if  f  is  n-ary,  xi0yi,...,xn0yn 
implies  f(x  i , . . . ,  xn)  0f\yx , . . . ,  yn)  for 
all  xx,...,xn,yx,...,yn  £  A.  A  congruence  6 
of  A  is  proper  if  0  is  not  a  trivial  equivalence 
relation.  An  algebra  A  is  simple  if  it  has  no 
proper  congruences. 

Moreover,  a  permutation  (p  on  A  is  an  auto¬ 
morphism  of  A  if  (p  satisfies  the  property  that 


for  every  operation  f  in  F,  if  f  is  n-ary , 

/(^(xi), . . . ,  ip(x„))  =  <p(f(x  1,...,  x„)) 

for  all  xx, . . . ,  xn  £  A.  The  set  of  all  automor¬ 
phisms  of  A  is  denoted  by  Aut  A.  An  automor¬ 
phism  (p  is  proper  if  ip  is  not  the  identity  per¬ 
mutation  id  a  of  A. 

Finally  in  this  section,  we  supply  some  defi¬ 
nitions  concerning  relations. 

Definition  1. 4  For  a  set  A,  an  h-ary  rela¬ 
tion  on  A  is  a  subset  of  the  Cartesian  product 
Ah .  For  an  n-ary  operation  f  in  Oa  and  an  h - 
ary  relation  p  on  A,  f  is  said  to  preserve  p  if 
(xij,x2j1...7xhj)  £  p  for  every  j  =  1,2, ...,n 
implies 

(/ (^ll  J  ^12 >  •  •  •  >  ^ln)j  . .  .  ,  f (x/ji , .  .  •  ,  Xhn))  £  P  • 

Definition  1.  5  Let  k  —  hm  where  h  >  2  and 
m  >  1.  A  set  T  =  {0X ,  ...,0m}  of  equiva¬ 
lence  relations  on  k  is  regular,  or  T  is  a  regu¬ 
lar  system  on  k,  if  ( i)  each  block  (equivalence 
class)  of  every  0*  has  h  elements  and  (ii)  if  Bi 
is  a  block  of  0,  for  all  i  =  1, . . . ,  m  then 

|  Bi  n  —  n  |  >  1 . 

Next,  the  relation  determined  by  T  is  the  rela¬ 
tion  A t  defined  as  the  set  of  all  (ai , . . . ,  a^)  £  kh 
such  that  for  each  i  =  1  it  holds  that 

ar0ia8  for  some  1  <  r  <  s  <  h. 

2  Characterization 
Theorem 

In  this  section  we  characterize  a  gigantic  pair 
and  for  this  purpose  we  need  the  following  def¬ 
initions.  Hereafter  A  is  assumed  to  be  the  set 
fc  =  {0,l,...,*-l}  (Jb>l). 

Definition  2. 1  For  a  divisor  p  of  k  denote  by 
Fp  the  set  of  all  permutations  of  k  with  l  \—kjp 
cycles  of  length  p. 

Let  f  £  Fp  have  cycles  Co,..., Ct~x . 

An  equivalence  relation  0  on  k  is  transver¬ 
sal  to  f  if  there  exist  1)  an  equivalence  relation 
A  on  l  distinct  from  the  least  equivalence  rela¬ 
tion  on  £  and  2)  an  element  c*  £  Ci  for  each 
i  £  t  such  that 

0={(frn(d)Jm(cj))\  iXj,  0  <  m  < p  —  1}  . 

A  permutation  ip  of  k  is  orthogonal  to  f 
if  1)  ip  £  Fq  for  some  prime  divisor  q  of  k,  2) 
/  o ip  =  ip  o  /  and  3)  if  q^p  then  each  cycle  of 
ip  meets  every  Ci  in  at  most  a  singleton. 
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Now,  we  shall  characterize  gigantic  pairs. 

Theorem  2. 1  Let  f  €  0(m)  and  g  €  0(n) 
w/iere  m  <  n  and  let  A  :=<  fc ;  </  >.  Then  the 
pair  (/,  <?)  is  gigantic  if  and  only  if  the  following 
conditions  (i)  -  (vii)  are  satisfied. 

(i)  m  —  1  and  f  £  Fp  for  some  prime  divisor  p 

of  k  ( with  cycles  Co, ... ,  C/_i  ). 

(ii)  n  >  1  and  g  is  minimal 

(iii)  Cij  U  •  *  •  U  Cih  is  not  a  proper  subuniverse 
of  A  for  any  0  <  i\  <  •  •  •  <  %h  <  t  —  1- 

(iv)  No  congruence  of  A  is  transversal  to  f . 

(v)  No  automorphism  of  A  is  orthogonal  to  f . 

(vi)  Let  k  —  hm  where  h  >  2  and  m  >  1.  If 
there  exist:  a)  a  permutation  ip  of  m  of  or¬ 
der  1  or  p  and  permutations  f0, . . . ,  fm-i  of 
h  such  that 

a)  /*  /<*!•••  /°r  eac/l  cycle 

(do  .  ,.dp-i)  of  if>, 

f3 )  for  every  fixed  point  i  of  ip  the  permu¬ 
tation  fi  is  of  order  1  or  p  and 

7 )  for  some  fixed  point  j  of  tp  the  permu¬ 
tation  fj  is  fixed-point-free  and  of  order  p 
and  b)  a  bijection 

V>:x.— >x  =  (x<°\...,x<'"-1>) 

of  k  onto  h  m  such  that  for  all  x  6  k 

m  =  (/o(*)WO),,...,/m-l(x)Wm-l)))  , 

(1) 

then  g  does  not  preserve  the  h-ary  relation 

{(oi, . . . ,  an)  €  kh  I  •  •  • ,«{?}  <  h-l 

for  every  i  —  0, . . . ,  m  —  1}. 

(vii)  a)  Let  k  =  pm  for  a  prime  p  and  m  >  l, 
b)  let  x  i — >  x  be  a  bijection  from  k  onto 
pm  (the  latter  considered  as  the  set  of  all 
m  x  1  matrices  over  p )  and  c)  let  for  all 

x  £  k 

f(x)  =  Ax  +  B  (2) 

where  A  =  P~lJP,  B  =  P~lB'  with  P 
a  nonsingular  (over  GF(p))  m  x  m  ma¬ 
trix  over  p,  J  the  m  x  m  Jordan  matrix 
with  diagonal  (1, . . . ,  1)  and  blocks  of  sizes 
andB'  =  (6i,...,6m)  €  pm  is 

such  that 

(I  +  J  +  J2  - \-Jp~1)B'  =  Omxi,  (3) 

and  btl+...+ia  ±  0  for  some  1  <  u  <  h. 
Then  g  does  not  preserve  the  quaternary  re- 
lation 

{(x,y,z,t)  €  fc4  1 1  =  x  0y  ©  ?}.  (4) 


Proof  (=>)  Let  the  pair  (/,  g)  be  gigantic.  As  / 
and  g  are  minimal,  they  are  of  the  five  types  1)  - 
5)  listed  in  the  previous  section.  Notice  that  ev¬ 
ery  minimal  operation  h  of  type  2)  —  5)  is  idem- 
potent  and  so  {0}  is  a  subuniverse  of  <  fc  ;/i  >. 
Since  {f,g}  is  complete,  at  least  one  of  /  and 
g  is  of  the  first  type.  As  m  <  n,  clearly  /  is  of 
the  first  type  and,  consequently,  is  unary.  Now 
g  cannot  be  unary  due  to  the  completeness  of 
{f,g}.  Thus  n  >  1  and  g  is  idempotent.  We 
have  proved  (ii).  To  prove  (i),  suppose  to  the 
contrary  that  f2  =  /.  Choose  a  in  the  image  of 
/.  Thus  f(a)  =  a  and,  since  g(a,  ...,a)  =  a,{a} 
is  a  subuniverse  of  B  =  <  fc  ;  f,g  >  which  con¬ 
tradicts  to  the  completeness  of  {f,g}.  It  follows 
that  /  p(x)  «  x  for  some  prime  p.  Suppose  that 
/  has  a  fixed  point  a  e  fc.  Then,  again,  (a)  is  a 
subuniverse  of  B.  Thus  /  is  fixed-point-free  and 
f  e  Fp  ,  proving  (i).  Each  of  the  conditions  (iii) 
-  (vii)  stipulates  that  {f,g}  does  not  belong  to 
certain  maximal  clone. 

(<t=)  Let  {f,g}  satisfy  the  conditions  (i)  - 
(vii).  Clearly  the  permutation  /  is  surjective. 
By  (ii),  g  is  essentially  more  than  unary  and, 
being  idempotent,  it  is  also  surjective. 

For  a  set  B  of  surjective  operations  contain¬ 
ing  an  essentially  more  than  unary  operation, 
the  general  completeness  criterion  from  [Ro  65, 
Ro  70A]  was  simplified  in  [Ro  70B]  (Theorem  3) 
to  the  following  criterion: 

For  a  set  B  with  the  above  properties,  the 
set  B  is  complete  if  and  only  if 

(A)  <  fc  ;  B  >  is  simple  and  has  no  proper  sub- 
universe  and  no  proper  automorphism, 

(B)  If  fc  =  pm  for  a  prime  p  and  m  >  1  and  if 
<  fc ;  +  >  is  an  elementary  abelian  p-group  then 
some  be  B  does  not  preserve  the  quaternary  re¬ 
lation 

{( x,y,z,x -y  +  z )  |  x,y,z  €  fc}, 

and 

(C)  If  k  =  hm  with  h  >  2  and  m  >  1  and  T  is 
an  /i-regular  system  on  k  then  some  b  6  B  does 
not  preserve  A r- 

In  view  of  this  criterion,  it  suffices  to  ver¬ 
ify  the  conditions  (A),  (B),  and  (C)  for  B  = 
<k;f,g>. 

Here  we  shall  only  consider  the  condition  (A). 
Let  C:=<  k  ;  /  >  and  consider  a  subuniverse  S 
of  C.  If  a  cycle  Ci  of  /  meets  5  then  clearly 
Ci  C  5;  thus  the  subuniverses  of  C  are  exactly 
the  sets  Cix  U  •  •  •  U  Cih  with  0  <  ii  <  . . .  <  %h  < 
l-  1.  Now  the  condition  (iii)  guarantees  that 
B=<  k  \f,g>  has  no  proper  subuniverse. 
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Next  we  assume  to  the  contrary  that  there 
exists  a  proper  congruence  0  of  B=  <  fc  ;  /,  <7  >. 
Consider  a  block  B  of  0. 

Claim  1  B  is  either  of  the  form 

(5) 

for  some  0  <  i\  <  ...  <  %h  <£,  h  ^  £,  or 
| B  n  Ci\  <  1  for  all  0  <  i  <  L 

Proof.  Suppose  \B  fl  Ctj  >  1  for  some  0  < 
i  <  £.  Then  there  exist  a  E  Ci  and  0  <  m  <  p 
such  that  a0fm(a).  As  /  preserves  0 ,  clearly 
fTnt(a)0fTn^+1\a)  for  all  i  =  0, . . .  ,p  —  1  (where 
f°(a)  =  a).  Now  p  being  a  prime,  the  set 
{m0,  ml, . . . ,  m(p—  1)}  coincides  modulo  p  with 
p  and  therefore  Ci  C  B.  Now  suppose  that 
| B  fl  Cj\  =  1  for  some  0  <  j  <  £,  j  ^  i-  Then 
B  fl  Cj  =  { b }  for  some  6  €  fc.  Choose  a  £  Ci. 
Clearly  aOb  implies  f(a)0f(b)  where  f(a)  E  Ci 
and  f(b)  E  Cj.  Now  a0f(a ),  f(a)0f(b)  and 
a  E  Ci  C  B  show  f(b)  E  B  D  Cj  =  { b },  which  is 
a  contradiction  to  /  being  fixed-point-free.  This 
implies  that  B  is  of  the  form  (5)  and  the  claim 
follows. 

Claim  2  |£  D  Ci\  <  1  for  all  0  <  i  <  £. 

Proof.  Suppose  to  the  contrary  that  B  is  of 
the  form  (5).  Since  the  operation  g  also  pre¬ 
serves  0 ,  it  maps  Bn  into  a  block  Bf  of  0.  How¬ 
ever,  g  is  idempotent,  and  so,  for  every  b  E  B 
obviously  b  =  <?(&, . . . ,  b)  E  proving  B '  =  B. 
Thus  g  maps  Bn  into  B  and  so  B  is  a  proper 
subuniverse  of  B  =  <  fc  ;  /,  <7  >.  This  is  against 
the  assumption  (iii),  and  proves  the  claim. 

From  Claim  2  it  is  easy  to  see  that  0  is 
transversal  to  /,  in  contradiction  to  (iv).  Thus 
B  is  indeed  simple. 

Now  suppose  to  the  contrary  that  there  ex¬ 
ists  a  proper  automorphism  ip  of  B. 

Claim  3  Each  proper  automorphism  ip  of 
B  is  fixed-point-free. 

Proof.  It  is  easy  to  see  that  F  :=  {x  E 
fc  |  <p(x)  =  x)  is  a  subuniverse  of  B.  We  have 
shown  above  that  B  has  no  proper  subuniverse. 
As  ip  ^  idfo  clearly  F  C  fc  and  so  F  =  <p  ,  prov¬ 
ing  the  claim. 

Claim  4  If  ip  is  a  proper  automorphism  of 
B  then  ip  E  Fr  for  some  proper  divisor  r  of  fc. 

Proof.  Suppose  to  the  contrary  that  ip  has  a 
cycle  T  of  length  c  and  a  cycle  of  length  d  where 
c  <  d.  Then  <p  c  is  a  proper  automorphism  of  B 
whose  fixed  points  include  T.  This  contradicts 
Claim  3  and  proves  the  claim. 

Claim  5  Fq  fl  Aut  B  is  nonvoid  for  some 


prime  divisor  q  of  fc. 

Proof.  By  Claim  4,  p  E  Fr  for  some  proper 
divisor  r  of  fc.  Choose  a  prime  divisor  q  of  r  and 
set  u  :=  r/q.  Then  ip  :=  (pu  is  a  proper  auto¬ 
morphism  of  B  and  ip  E  Fq  ,  proving  the  claim. 

Claim  6  There  exists  an  automorphism  ip 
of  B  which  is  orthogonal  to  /. 

Proof.  By  Claim  5,  there  exists  ip  E  Fq  n 
AutB.  Clearly  ip  satisfies  the  conditions  1) 
and  2)  in  the  definition  of  orthogonality.  Sup¬ 
pose  that  ip(a)  =  fm(a)  for  some  a  E  fc  and 
0  <  m  <  p.  Then  {/m0(a), . . . ,  /m(p-1)(a)}  is 
the  cycle  Ci  containing  a  and  hence  q  =  p.  This 
proves  the  claim. 

We  have  shown  that  ip  is  orthogonal  to  /. 
However,  this  contradicts  (v). 

Thus  we  have  verified  that  the  condition  (A) 
holds  for  B.  Due  to  the  lack  of  space  we  shall 
omit  the  proof  for  the  conditions  (B)  and  (C).  □ 

The  conditions  (i)  -  (vii)  in  Theorem  2.1  sim¬ 
plify  if  fc  is  a  prime.  We  state  the  result  without 
proof. 

Corollary  2. 1  Let  k  be  a  prime ,  and  let  f  E 
O and  g  E  O ^  where  m  <  n  .  Let  A  := 
<  fc  ;  g  > .  Then  the  pair  (/,  g)  is  gigantic  if  and 
only  if 

(i)  m  =  1  and  f  is  a  cyclic  permutation  of  fc  , 

(ii)  n  >  1  and  g  is  minimal \ 

(v’)  /  is  not  an  automorphism  of  A  ,  and 

(vii’)  J/n  =  2,0<6<fc  and  ip  :  x  1 — >  x  is 
permutation  of  fc  such  that  for  all  x  E  fc 
f(x)  =  x06  then  for  no  a  E  fc  the  operation 

g'(x,y)  :ss 

is  of  the  form  ax  0  (1  —  a)y  . 

Example  1.  Let  fc  =  2  (Boolean  case).  There 
are  4  nonunary  minimal  operations,  namely,  V, 
A,  d  and  r,  where  V  and  A  are  the  disjunction 
(OR)  and  the  conjunction  (AND),  d(x,y>z)  « 
(x  A  y)  V  (y  A  z)  V  (z  A  x)  and  r(x ,  y,  z)  «  x  0  y  0 
z.  Notice  that  -ix  is  an  automorphism  of  d  and 
that  r  is  quasi-affine.  The  remaining  two  pairs 
(-1,  V)  and  (-i,  A)  both  satisfy  the  conditions  of 
Corollary  2.1.  This  proves  that  for  fc  =  2  there 
are  exactly  2  gigantic  pairs. 

Corollary  2.  2  [Sz  92]  A  gigantic  pair  eocists 
for  every  prime  fc. 


77 


Proof  In  Corollary  2.1,  set  f(x)  x  ©  1 
and  also  set  g{x,y)  for  all  x,y  G  k  such  that 
g{x,y)  =  x  if  x  =  y  and  g(x,y)  =  k  -  1  if 
x  ^  y.  It  is  easy  to  see  that  g  is  minimal; 
in  fact,  g  is  a  semilattice  join  whose  order  is 
0  <  Jfe  —  1, —  2  <  &  —  1.  Every  automor¬ 
phism  of  <  k ;  g  >  fixes  the  element  fc  —  1,  and 
this  proves  (v’)-  To  see  (vii’),  it  suffices  to  note 
that  an  operation  ax  ©  (1  —  a)y  takes  each  value 
from  k  exactly  k  times  which  is  not  true  for  g 
and  hence  also  not  true  for  g'.  □ 

3  The  existence  of  a  gigan¬ 
tic  pair  for  k  not  a  power 
of  2 

In  this  section,  we  show  that  a  gigantic  pair  ex¬ 
ists  for  any  k  that  is  not  a  power  of  2. 


Theorem  3, 1  If  k  is  not  a  power  of  2  then 
there  exists  a  gigantic  pair. 


Proof  Let  p  denote  the  greatest  prime  divisor 
of  k  and  let  l  :=  k/p.  Clearly  p  >  2  and  in  view 
of  Corollary  2.2  we  may  assume  that  I  >  1. 

First,  we  define  a  unary  operation  /  and  a 
binary  operation  g  on  k  and,  then,  prove  that 
the  pair  of  those  operations  is  in  fact  a  gigantic 
pair. 

We  define  /  by  setting 


f{ip+3 )  '■=  | 


ip  +  j  +  l 
ip 


if  i  €£,  0<j<p-l , 
if  i  €  t,  j  =  p  -  1 . 


Thus  the  cycle  Ci  of  /  is  (ip,  ip+ 1, . . . ,  ip+p- 1) 
for  all  i  €  £ ;  i.e.,  f  is  expressed  in  the  cyclic  no¬ 
tation  as 

/  =  (01  •••  p-l)(pp+l  •••  2p  —  1) •  •  • 
...  ((£  —  l)p  (l -  l)p  +  1  •••  k-  1). 

An  algebra  A  =  <fc;V>isa  semilattice  if 
the  binary  operator  V  is  associative,  commuta¬ 
tive  and  idempotent  (i.e.,  it  satisfies  x\/(yVz)  = 
(x  V  y)  V  z,  a;Vy  =  y  Vx  and  x  V  x  —  x 
for  all  €  k).  The  binary  relation  <  on  k 
corresponding  to  A  is  defined  by  setting  a  <  b 
whenever  aVb  =  b.  It  is  well-known  that  <  is  a 
(partial)  order  in  which  a  V  b  is  the  join  of  a  and 
b .  Conversely,  an  order  <  on  k  in  which  each 
pair  has  a  join  determines  a  semilattice. 

Let  g  be  the  following  semilattice  operator  V 


on  k.  For  every  x,  y  in  k  set 


xVy  =  < 


x 

(i  +  1  )p 


if  x  =  y , 

if  x  =  ip  +j,  y  =  ip  +j ' 
(0<i<£-2, 

1<J,  ]'  <P~1, 
otherwise . 


The  Hasse  diagram  of  the  order  corresponding 
to  the  semilattice  <  k ;  V  >  is  in  Figure  1.  No¬ 
tice  that  it  is  a  tree. 


We  verify  the  conditions  (i)  -  (vii)  of  Theo¬ 
rem  2.1. 

I)  The  condition  (i)  clearly  holds. 

II)  It  is  well-known  that  a  semilattice  join 
is  a  minimal  operation;  indeed,  it  generates  ex¬ 
actly  one  essentially  n-ary  operation,  namely, 
x\  V  •  •  •  V  xn  and  from  it  x  V  y  is  obtained  as 
xVyV**‘Vy. 

III)  Let  0  <  ii  <  . . .  <  ih  <  l  -  1  be 

such  that  C  =  U  •  •  •  U  cih  is  a  subuniverse 
of  A=<  k ;  V  >.  Since  p  >  3,  it  is  clear  that 
ijp  +  1,  ijp  +  2  €  Ci-  for  each  1  <  j  <  h. 
Consequently,  (ij  +  1  )p  —  ( ijp  +  1)  V  (ijp  +  2) 
belongs  to  C  and  therefore  ij+i  =  ij  +  1.  It  fol¬ 
lows  that  {ii,..., ih}  =  +  1}*  By 

the  same  reason  we  have  0  G  C,  proving  i\  =  0. 
Thus  C  =  k  and  (iii)  holds. 

IV)  First  we  remark  that  each  block  B  of  a 
congruence  0  of  a  semilattice  A  is  a  subsemi¬ 
lattice  of  A.  Now,  suppose  to  the  contrary  that 
there  exists  a  congruence  0  of  the  semilattice 
A  transversal  to  /.  Let  A  be  the  correspond¬ 
ing  equivalence  on  i  and  let  {£o, .  -  -  ,ifc- 1}  be  a 
nonsingleton  block  of  A  (where  0  <  io  <  . . .  < 
ih- 1  <  I  —  1).  We  distinguish  two  cases.  A) 
Suppose  io  >  0.  Consider  the  block  B  of  0  con¬ 
taining  io p.  Clearly  B  fl  =  {i\p  +  ci}.  Here 
(iop)  V  (iip  +  Ci)  =  0  £  B  due  to  io  >  0  which 
contradicts  to  the  above  remark.  B)  Suppose 
i0  =  0.  Consider  the  block  B  of  0  containing 
1  and  let  B  H  Cix  =  {iip  +  cx}.  If  c\  >  0 
then,  again,  1  V  (iip  +  C\)  =  0  £  B  and  we 
are  done.  Thus  let  c\  =  0.  Consider  the  block 
B '  of  0  containing  2.  Then,  by  the  definition, 
J B'nCij  =  {iiP+1}  and  2  V  (iip+1)  =  0  g  B'. 
This  contradiction  proves  the  claim. 

V)  It  is  obvious  that  every  semilattice  auto¬ 
morphism  fixes  its  greatest  element.  Since  every 
permutation  of  k  orthogonal  to  /  is  fixed-point- 
free,  it  is  clear  that  no  automorphism  of  A  is 
orthogonal  to  /. 

Here  again  we  omit  the  proof  for  VI)  and  VII) 
because  of  the  lack  of  space. 
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0 


ip- 1 


1  2  p- 1  p+1  p+2  2p— 1  (t—2)p+\  (£-l)p-l 


Figure  1:  The  semilattice  for  the  operation  V 


Thus,  the  pair  (/,V)  is  proved  to  satisfy  (i) 
-  (vii)  and  so  it  is  gigantic  by  Theorem  2.1.  □ 

4  Conclusion 

We  defined  a  "gigantic  pair”  as  a  pair  of  minimal 
operations  which  together  generate  the  clone  of 
all  operations  and  then  gave  a  necessary  and  suf¬ 
ficient  condition  for  a  pair  of  operations  to  be 
gigantic. 

Next,  we  constructed  a  gigantic  pair  for  each 
k  that  is  not  a  power  of  2.  This  is  a  generaliza¬ 
tion  of  the  results  of  Szabo  and  Czedli.Thus,  we 
now  know  that  a  gigantic  pair  exists  for  every  k 
withfc#2m(m>2). 

The  remaining  case  k  =  2 m  (m  >  2)  has  to 
be  considered  separately.  Notice  that  the  mini¬ 
mal  operation  g  can  be  neither  a  majority  oper¬ 
ation  nor  a  quasiprojection,  because  if  it  is  one 
of  them  then  each  two-element  subset  of  k  is  a 
subuniverse  of  <  fc ;  g  >  which  contradicts  to 
(iii).  For  simplicity,  identify  k  with  2m  and  set 
f(x)  «  x  ®  e  (where  ©  is  the  mod  2  componen¬ 
twise  sum  on  2m  and  e  :=  (1, . . . ,  1)).  Then  the 
operation  g(x ,  y,  z) x  ©  y  ©  2  does  not  satisfy 
(vii)  and  so  g  is  not  of  this  type.  Thus,  if  a  gi¬ 
gantic  pair  (/,  g)  exists  then  g  must  be  binary 
and  idempotent.  Unfortunately,  due  to  (iii)  the 
operation  g  cannot  be  a  semilattice  join. 

NOTE:  After  the  completion  of  this  paper,  the  au¬ 
thors  were  informed  that  the  existence  of  gigantic 
pairs  for  every  A;  >  1,  including  k  being  a  power 
of  2,  was  established  by  G.  Czedli,  R.  HalaS,  K.A. 
Kearnes,  P.P.  Palfy  and  A.  Szendrei[CHKPS].  Their 
work  and  ours  were  carried  out  independently  and 
in  parallel. 
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Maximal  chains  of  partial  clones  containing  all  idempotent  partial 

functions 

Lucien  Haddad  and  Jean  Fugere  * 


Abstract 

Let  k>  2  and  k  be  a  k-element  set  We  show  that  ev¬ 
ery  partial  clone  containing  all  idempotent  partial  func¬ 
tions  on  k  is  finitely  generated.  Moreover  we  construct 
maximal  chains  of  length  2k  of  such  partial  clones. 


1  Preliminaries 

Let  k  >  2  and  k  :=  —  1}.  For  a  positive 

integer  n,  an  n-ary  partial  function  on  k  is  a  map  /  : 
dom  (/)  -4  k  where  dom  (/)  is  a  subset  of  kn  called  the 
domain  of  /.  Denote  by  Par^(k)  the  set  of  all  n-ary 
partial  functions  on  k  and  let  Par(k)  :=  (J  Par(n)(k). 

nG  N 

A  partial  function  g  G  Par^(k)  is  a  subfunction  of 
/  G  Par^(k)  (in  symbols  g  <  f)  if  dom  (g)  C  dom  (/) 
and  g{a)  =  f(a)  for  all  a  g  dom  (g).  Moreover  let 

Op(n)  (k)  :=  {/  6  Par(n)(k)  |  dom  (/)  =  kn)  and 
Op(k)  be  the  set  of  all  total  (or  everywhere  defined) 
functions  on  k,  i.e., 

Op(k)  :=  [J  Op(n)(k)  . 

n€N 

In  the  sequel  we  will  say  function  for  total  function. 
We  define  the  superposition  of  partial  functions  as  fol¬ 
lows.  For  n,m  >  1,  /  €  Parn^(k)  and  pi,...,pn  € 
Par^(k),  the  superposition  of  /  and  pi,...,pn,  de¬ 
noted  /[pi, . . .  ,pn],  is  the  m-ary  partial  function  h  on 
k  defined  by 

n 

dom  ( h )  :=  {a  e  km  |  a  e  P|  dom  (gi)  and 

i=l 

(ffi(S).---.fln(g))6dom(/)}; 

and 

h(a)  :=  f(gi(a),...,gn(a)) 
for  all  a  e  dom  ( h ). 
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For  every  positive  integer  n  and  every  1  <  i  <  n, 
let  ef  denote  the  n-ary  i-th  projection  defined  by 
ef  (oi, . . . , an)  -  ai  for  all  (oi, . . . , an)  G  kn,  (notice 
that  e”  is  a  total  function).  Furthermore  let 

J(k)  :=  {e”  |  1  <  i  <  n  <  oo} 

be  the  set  of  all  projections  on  k. 

Definition  1.  A  partial  clone  on  k  is  a  composition 
closed  subset  of  Par(k)  containing  the  set  of  all  pro¬ 
jections  J(k).  A  partial  clone  C  contained  in  the  set 
of  all  functions  Op(k)  is  called  a  clone  on  k.  A  par¬ 
tial  clone  C  is  strong  if  it  contains  all  subfunctions  of 
its  functions;  i.e.,  if  for  every  /  G  C  and  g  G  Par(k), 
g  <  f  g  G  C.  Following  [2],  we  define  the  strong 
closure  Str  (C)  of  the  partial  clone  C  as  the  smallest 
strong  partial  clone  containing  C.  Note  that 

Str  ( C )  :=  {g  G  Par(k)  |  g  <  f  for  some  /  G  C). 
Clearly  a  partial  clone  C  is  strong  if  and  only  if  C  — 
Str  (C7).  It  is  shown  in  [2]  that  a  partial  clone  on  k  is 
strong  if  and  only  if  it  contains  the  set  Str  (J(k))  of  all 
partial  projections  on  k. 

Let  h  >  1,  let  p  be  an  h-ary  relation  on  k  and  let 
/  be  an  n-ary  partial  function  on  k.  We  say  that 
/  preserves  p  if  for  every  h  x  n  matrix  M  =  [ Mij ] 
whose  columns  M+j  G  p,  (j  =  l,...n)  and  whose 
rows  Mi*  G  dom  (/)  ( i  —  l,...,h),  the  h- tuple 
(/(Mi*),...,/(M/l*))  G  p.  Set  pPol(p)  :=  {/  G 
Par(k)  |  /  preserves  p}  and  Pol  p  :=  pPol  p  0  Op(k). 
Let  n  >  1  and  /  G  Par^(k).  We  say  that  /  is  idem- 
potent  if  for  all  x  G  k, 

(a;, . . .  ,x)  e  dom  (/)  =>•  f{x,  ...,x)  =  x. 

Notice  that  1  :=  n  pp<>1  {x}  is  the  set  of  all  idempo- 

tent  partial  functions  and  Q  Pol  {a;}  is  the  set  of  all 

idempotent  functions  on  k.  We  refer  the  reader  to  [4] 
for  examples  and  more  details. 

In  this  paper  we  carry  on  the  study,  initiated  in  [4], 
of  the  partial  clones  containing  all  partial  idempotent 
functions  on  k.  We  show  that  every  partial  clone  that 
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contain  the  set  of  all  partial  idempotent  functions  is 
finitely  generated.  Moreover,  we  construct  families  of 
maximal  chains  of  such  partial  clones  of  size  2k. 

The  counterpart  for  clones  of  the  result  we  want  to 
establish  for  partial  clones  is  known.  We  will  use  the 
following  result  due  to  R.  Quackenbush. 


2  New  results 

2.1  In  this  first  subsection  we  show  that  each  of 
the  finitely  many  partial  clones  on  k  that  contain  X 
is  finitely  generated.  We  start  with 

Lemma  4  The  partial  clone  X  is  finitely  generated. 


Proposition  1  ([6])  Let  k  >  2.  Then  the  clone 
P|  Pol  {x}  of  all  idempotent  functions  on  k  is  gen- 

xGk 

erated  by  all  binary  idempotent  functions.  ■ 

The  concept  of  “separating  clone”  defined  in  [2]  and 
[3]  will  be  used  to  show  one  of  our  main  results. 

Definition  A  clone  C  on  k  is  separating  if  there 
exists  m  >  1  such  that  for  all  n  >  1  and  all  b  e  kn, 
there  exist  €  C  fi  Op^(k)  such  that  for 

every  a  e  kn, 

(^l(S)j  *  *  •  J0m(®))  =  (Pl(£)j  •  *  *  i9m{b))  < — >  2  ~ 

We  employ  the  following  result  from  [2]  (see  also  [3]). 

Proposition  2  Let  k  >  2  and  C  be  a  clone  on  k. 
Then  the  partial  clone  Str  (C)  is  finitely  generated  if 
and  only  if  C  is  a  finitely  generated  separating  clone,  u 

A  description  of  all  partial  clones  containing  all  idem- 
potent  partial  functions  is  given  in  [4].  We  will  use 
the  following  notation  introduced  in  [4].  For  A  := 
{ai,..., am}  C  k  with  a\  <  ...  <  am,  denote  by 
Va  the  partial  clone  determined  by  the  m-ary  relation 
{(ai, . . . ,  cim)}  ) 

Va  ==  pPol  {(ax, .... om)}  =  |J  U  e  Par(n)(k)  I 

neN 

[(ax, . . . ,  a*)  e  dom  (/)  V  i  =  1, . . . ,  m]  => 
ffaii  ■  *  • )  Ui)  =  y  i  1, •  •  • , 

For  instance  if  T  =  {{0},  {0, 1, 2}},  then 

H  Px=pPol{0}npPol{(0,l,2)}. 

We  have 

Proposition  3  ([4])  Let  k  >  2.  There  are  finitely 
many  partial  clones  on  k  that  contain  the  set  X  of  all 
idempotent  partial  functions  on  k.  Furthermore,  a  par¬ 
tial  clone  C  on  k  contains  X  if  and  only  if  there  is  a 
family  T  of  subsets  of  k  such  that  C  =  n*-  ■ 

X<=T 


Notice  that  Par(k)  =  n  pPol  {*}. 

x€0 


Proof.  We  show  that  P|  Pol  {x}  satisfies  the 

s€k 

conditions;  in  Proposition  2  and  moreover  that 
X  =  Str  (P|  Pol  {a;}).  As  mentioned  in  the  section 

a?6k 

above,  the  fact  that  P|  Pol  {x}  is  a  finitely  generated 

xek 

clone  is  a  consequence  of  [6],  One  may  also  prove 
this  result  by  using  the  Baker-Pixley  criterion  which 
states  that  every  clone  that  contain  a  near-unanimity 
function  is  finitely  generated  (see  [1]  and  [3]).  Now  it 
is  easy  to  see  that  P|  Pol  {x}  contains  the  ternary 
xek 

near-unanimity  function  g  defined  by  g(x i,X2,#3)  =  y 
if  at  least  two  of  xi,X2,#3  are  equal  to  y  and 
g(x x,X2,Xz)  =  0  if  X\  ^  x2  #  £3  i1  Next  we 
show  that  P|  Pol  {x}  is  a  separating  clone  on  k. 
®€k 

Choose  m  =  2  and  let  n  >  1  and  b  e  kn.  Denote 
by  the  n-ary  diagonal  relation  on  k  defined  by 
u/n)  :=  {(x, . . .  ,x)  |  x  €  k}.  We  distinguish  two  cases. 


Case  l:  be  u>(n).  Then  b  =  (6, . . . ,  b)  for  some  b  £  k. 
Choose  c  e  k  \  {6}  and  define  the  n-ary  function 


ei  (S)?  if  £  € 
c,  otherwise, 


note  that  g~x(b)  =  {(6,..., b)},  and  consequently 

(»(a).ff(2))  =  (ff(4).ff(A))  *=>  2  =  vaek". 

Case  2:  b  £  u>(nK  Define  the  two  n-ary  functions  g\ 
and  02  by 


{e™ (x),  if  x  e 

if  *  = 

1,  otherwise, 

and 

f  e”(£)>  if  £  €  w(n) 

safe)  =  \  !>  if  *  =  b 

^  0,  otherwise, 

Thus  {91(b), g2(b))  =  (0,1).  Let  a  e  kn,  a  ±  b.  Then 
0i («)  =  9i(b)  =►  a  =  (0,...,0)  and  so  g2(a)  =  0  # 
92(b)  =  1. 
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Finally  we  show  that  X  =  Str  (P|  Pol  {#}).  Let  /  £ 

a€k 

X  \  Pj  Pol  {a;}  be  n-ary  and  define  the  function  g  by 

reek 

setting 

{/(£),  if  x  e  dom  (/) 

e?fe),  if  £  e  w(n)  \  dom  (/) 

0,  otherwise. 

Let  x  =  arn)  £  u;(n).  If  «  €  dom  (/),  then 

p(x)  =  /(x)  =  aq  since  /  £  Z.  Else  if  £  ^  dom  (/), 
then  g(x)  =  aq  by  definition,  thus  g(x)  =  aq  for  all 

x  e  a/n\  i.e.,  5  G  P|  Pol  {a:}  and  by  definition  g  is  an 

®€k 

extension  of  the  function  /.  ■ 

Combining  the  above  lemma  with  Proposition  3  we  get 

Corollary  5  Let  k  >  2.  Every  partial  clone  that  con¬ 
tains  the  set  X  of  all  partial  idempotent  functions  is 
finitely  generated. 

2.2  In  this  second  subsection,  we  construct  maximal 
chains  of  length  2k  of  partial  clones  containing  the  set 
X  on  k.  We  need  the  following  terminology:  if  C\  and 
C2  are  two  partial  clones,  we  say  that  C2  covers  C\  if 
Ci  C  C2  and  the  inclusions  C\  C  D  C  C2  hold  for  no 
partial  clone  D  on  k.  Moreover  we  say  that  the  chain 
of  partial  clones 

Ci  C  C2  c  . . .  C  Cfi~  1  c  Cn 
is  a  maximal  chain  if  the  partial  clone  Ci  covers  the 
partial  clone  Ci- 1  for  all  t  =  2, . . . ,  n.  We  start  with 
the  following  general  result. 

Lemma  6  Let  Ci  and  C2  be  two  partial  clones  on  k 
with  Ci  C  Suppose  that  there  is  a  subset  D  of  k 
and  a  £  k  \  D  such  that 
1)  P  pPol  {a;}  C  Ci  C  C2  and 

x€D\J{a} 

2)  for  every  n>  1  and  every  f  £  Par^  (k) , 

/  £  C^  \  c if  and  only  if  the  following  two  condi¬ 
tions  hold: 

i)  V  x  £  D  U  {a}  (*,...,*)€  dom  (/),  and 

ii)  V  d  G  D  f(d , . . . ,  d)  =  d,  and 

iii)  /(a, . . . ,  a)  ^  a. 

Then  the  partial  clone  C2  covers  the  partial  clone  C\. 


then  P|  pPol  {x}  =  pPol  {a}.  As  pPol  {a}  is  a 

x€Du{a} 

maximal  partial  clone  on  k,  we  deduce  in  such  a  case 
that  Ci  =  pPol  {a}  and  C2  =  Par(k).  Let  m  >  1 
and  f0  G  62  \  Ci  be  m-ary.  Then  by  assumption 
p  :=  {(d, . . . ,  d)  |  d  €  D}U{(a, . . . ,  a)}  C  dom  (/0).  We 
show  that  C2  =  (Ci  U  {/o})*  Since  C\  C  (Ci  U  {/o}), 
we  have  that  (Ci  U  {/o}>  is  a  strong  partial  clone 
and  so  it  contains  the  partial  function  fi  :=  /o|p- 
Consequently  the  unary  partial  function  /  defined  by 
/  :=  belongs  to  (Ci  U  {/0}>.  Notice 

that  dom  (/)  =  D  U  {a}  and  as  /0  satisfies  the  con¬ 
ditions  ii)  and  iii)  above,  f(d)  =  d  for  all  d  £  D  and 
f(a)  ^  a.  Now  let  g  G  C2  \  Ci  be  n-ary.  We  show  that 
g  £  (Cl  U  {/o}>.  As  k  >  2,  we  may  assume  w.l.o.g. 
that  a/0.  Define  three  n-ary  partial  functions  go>ga 
and  g  by  setting 

dom  (00)  =  dom  (ga)  =  dom  (g)  -  dom  (g)  and, 
for  y  £  {0,  a} 

{a,  if  x  =  (a, . . . ,  a) 

d,  if  x  =  (d, . . . ,  d)  and  d  G  D 
y  otherwise, 


while 

-(x\  _  I  °>  if  *  =  (“>•••> a) 

1  g(x)  otherwise. 

Notice  that  from  g  €  C2  \  C\ ,  we  have  g(d,  ...,d)  =  d 
and  thus  g(d, . . . ,  d)  =  d  for  all  d  €  D.  Now  since 
h{x,...,x)  =  x  for  all  x  €  D  U  {a}  and  all  h  € 
{fto, 5<», g},  we  have  {go,ga,g}  C  n  pPol  {x}  C 

i6DU{a} 

Cl,  hence  {go,9a,g}  C  C\.  _ 

Set  /  :=  /[go]-  Then  /  G  (Ci  U  {/0}),  dom  (/)  = 
dom  (g)  and 


7/  \  if  £  =  (d, . . . ,  d)  and  d  €  D 

J  w  —  1  7(a)  otherwise 

We  need  one  more  partial  function.  Let  a  be  the  4-ary 

partial  function  defined  by 

dom  (a)  :=  {(a,  a,  f{a), a)}  U  {(d,d,d,d)  \  d£  D) 

U{(0,a,/(a),x)  |  x  6  k}, 

and  for  x  :=  (xi,X2,X3,X4), 


{g(o, . . . ,  a),  if  x  =  (a,  a,  f(a),a) 
d,  if  x  =  (d, . . . ,  d)  and  d  €  D 
x,  if  x  =  (0,  a,  f(a),  x)  and  x  €  k 


Proof.  First  notice  that  since  n  ppd  m  e 

xeDU{a} 

Ci  C  C2,  both  Ci  and  (72  contain  the  set  Str  (J(k)) 
of  all  partial  projections  on  k,  and  consequently  they 
are  both  strong  partial  clones.  Furthermore,  if  D  =  0, 


As  f(a)  ^  a,  we  have  (a,  a,  a,  a)  £  dom  (a)  and  more¬ 
over  it  is  clear  that  (x,x,x,x)  G  dom  (a)  x  £  D. 
Combining  this  with  a(d,  d,  d,  d)  =  d  for  all  d  £  D,  we 
get  that  a  £  n  pPol  {#}  C  Ci,  thus  a  £  Ci.  Now 

xeDU{a} 
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we  show  that  g  =  a[gQ,ga,  f,g].  Set  7  :=  a[g0,ga,  f,  gl¬ 
ia  view  of  dom  (go)  =  dom  (g),  we  have  dom  (7)  C 
dom  (g).  Conversely,  using  the  definitions  of  go ,  ga ,  f 
and  g ,  we  can  verify  that 

(9o(x),ga(x)J(x),g(x))  €  dom  (a) 
for  all  x  £  dom  ( g ),  thus  dom  (7)  =  dom  ( g ).  Let 
x  e  dom  (7).  Then 

{a(a, a,  /(a), a)  =  g(a , . . . , a)  if  x  =  (a, . . . , a) 
a(d,  d,  d,  d)  =  d,  if  x  =  (d, . . . ,  d)  and  d  £  D 
a(0,a}/(a),<j(£))  =  <?(£)  otherwise 

and  so  by  the  definition  of  q ,  <7(2: )  =  7(0:)  for  all  £  £ 
dom  (; g ),  i.e.,  0  =  7  €  (Ci  U  {f0}).  ■ 


We  now  construct  maximal  chains  of  length  2k  of  par¬ 
tial  clones  containing  the  set  I  of  all  partial  idempotent 
functions  on  k.  We  will  use  the  following  notation.  If 
C  and  Cf  are  two  partial  clones  on  k  with  C  C  C', 
then  the  interval  of  partial  clones  [C,  C']  is  defined  by 
[Cy  C)  =  {D  |  D  is  a  partial  clone  on  k 

and  C  CD  C  C'}. 
We  start  with  an  example  to  illustrate  the  general  con¬ 
struction.  Let  k  =  4.  For  notatioiial  ease  set 
C4  :=  p|  Vx  and 

*<E{{0},{1},{2},{3}} 

Cz  :=  P|  Vx, 

*6{{0},{1},{2}} 

therefore  C3  is  a  partial  clone  on  {0, 1,2,3}  that  con¬ 
tain  the  set  of  all  partial  idempotent  functions  C4  on 
{0, 1, 2, 3}.  We  show  that  the  interval  of  partial  clones 
[C^Cz]  contains  several  unrefinable  chains  of  partial 
clones  of  length  8  on  {0, 1, 2, 3}.  Set 
T\  :=  {{0},  {1},  {2},  {0, 1,2, 3}},  F2  :=  Fx  U  {1, 2, 3}, 
Tz  :=  F2  U  {0, 2, 3},  F4  :=  Fz  U  {0, 1, 3}, 

Fs  :=  F4  U  {2, 3},  Fe  :=  Fz  U  {1, 3},  F7  :=  F6  U  {0, 3} 
and  T%  :=  F7  U  {3}. 

The  equality  C4  =  P|  Vx  is  straightforward.  Con- 
XGTe 

sider  the  chain  of  partial  clones 
Ci  C  p|  Vx  c  f|  PxC-C 

xeFi  xefe 

P  Vx  C  P|  Vx  C  Cz-  (★) 

X£T2  XZFi 

It  is  easy  to  verify  that  Lemma  6  applies  to  each  pair 
of  consecutive  partial  clones  in  the  chain  (★).  Consider 
for  example  the  two  partial  clones 
P  Vx  and  P  Vx-  Then  clearly 

xer3  xer2 

P  pPol  {#}  C  p  Vx  and  for  any  n-ary  partial 
£€{0,2,3}  xer3 


function  f,  f  6  p|  Vx  \  p|  Vx  iff 

(0,  •  •  • ,  0),  (2, . . . ,  2),  (3, . . . ,  3)  €  dom  (/)  and 

/(0 . 0)  =  0,  f(2, ...  ,2)  =  2  and  /(3,...,3)  #  3. 

So  take  D  :=  {0, 2},  a  =  3  and  apply  Lemma  6. 

We  now  turn  to  the  general  case. 

Let  2  <  t  <  k  -  1.  Put  Ct  :=  P  Vx 

X€{{0},...,{*-1}} 

and  Ct+X  P  Vx-  We  show  that  the 

:  X€{{0},...,{t-l},{t}} 

interval  of  partial  clones  [C*+ 1,  Ct]  contains  maximal 
chains  of  size  2*.  Put 

==  {{0}, . . . ,  {t  -  1},  {0, 1, . . . ,  t-  1,  t}} 

and  consider  the  partial  clone  Cx,t+X  :=  P  Vx- 

X6Fi,t+i 

Then  an  n-ary  partial  function  /  satisfies 

fect\cu+1 

if  and  only  if  /  G  pPol  {0}  D  . . .  n  pPol  {t  —  1}  and 
/  £  pPol  {(0, -  1, t)}.  This  holds  if  and  only 
if  {(0,...,0),  C  dom  (/),  /(d,...,d)  = 

d  for  all  d  £  {0, . . . ,  t  —  1}  and  f(t , . . . ,  t)  ^  t.  By 
Lemma  6  the  partial  clone  Ct  covers  CXlt+i  (here  D  = 
{0, . . . ,  £  —  1}  and  a  =  £). 

Starting  with  FXjt+X,  we  define  recursively  2l  families 
of  subsets  of  t  -f  1  :=  {0, 1, . . . ,  t  -  1,  t}  as  follows:  in 
the  first  step  we  add  to  Fij+X  one  by  one  all  subsets  of 
the  set  t  -f  1  that  are  of  size  t  and  contain  the  element 
t.  More  precisely,  let  YXit , . . . ,  Yt,t  be  a  list  of  all  subsets 
of  t  +  1  of  size  t  and  containing  the  element  t.  Set 

F\st  :=  u  {Yi,t},  -  -  -  ,Ft,t  -=■  Ft-Xit  U  {Yt,t}- 

Next  let  1  <  j  <  t  —  1  and  suppose  that  each  of 
Fxj+x, . . .  is  defined.  We  use  the  family  of 

all  subsets  of  t  +  1  of  size  j  that  contain  the  element 
t.  There  are  such  sets,  let  YXj,.. .  ,Y^  j  be  a 
list  of  them.  Set 

In  this  process,  the  second  to  the  last  family  of  subsets 
to  be  defined  is  Ft ,2,  it  actually  consists  of  all  subsets 
of  t  +  1  of  size  at  least  2  and  that  contain  the  element 
t.  Finally  Fx,x  is  defined  as  the  family  of  all  subsets  of 
t  +  1  that  contain  the  element  t. 

For  every  j  =  1, . . . ,  t  and  every  i  =  1, . . . ,  ( ,  put 

c,.t-  n  Vx-  Clearly  the  family  of  partial  clones 

xefij 

{Cij}j= i  t  i= i  ( .*  )  is  contained  in  the  interval 

[Ct- |-i,  Ct ].  We  show  that  it  forms  a  maximal  chains  of 
partial  clones.  To  see  this  let  j  =  2, . . . ,  t  -  1  and  con¬ 
sider  the  two  partial  clones  C^  and  CXj.  We  use 
Lemma  6  to  show  that  Cfty.+1  covers  CXj.  W.l.o.g. 
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suppose  we  chose  the  set  Yi  j  to  be  {0, . . . ,  j  —  2,  /,}. 
Clearly  statement  1)  of  Lemma  6  holds  for  Cij  and 
<?(<)  J+i>  with  L>  =  {0, . . .  ,j  -  2}  and  a  =  t.  Now  let  / 

be  an  n-ary  partial  function.  Then  /  £  \  C\  j 

if  and  only  if  it  preserves  each  j- ary  relation  on  k 
of  the  form  {(ai, ....  «j-i ,  t)}  (where  ai ,  •  ■  ■ ,  a7-i  € 
{0, ...,t  —  1}  are  pairwise  distinct),  and,  moreover  / 
does  not  preserve  the  relation  {(0, . . . ,  j  —  2,f)}.  This 
holds  if  and  only  if 

i)  V  x  €  {0, . . . ,  j  -  2}  U  {t}  (»,...,*)€  dom  (/),  and 

ii)  V  d  €  {0, . . .,}  -  2},  f(d,  ...,d)=d,  and 

iii)  ±  t. 

By  Lemma  6,  Cm  J+1  covers  the  partial  clone  C\,j. 
We  leave  to  the  reader  the  proof  that  the  partial 
clone  Ci+ij  covers  the  partial  clone  Cij  for  every 
j  =  1, . . .  ,t  and  every  i  =  1,...,  (*)  -  1.  This  proof  is 
similar  to  the  one  given  above,  however  here  one  has 
to  specify  the  choices  of  the  sets  Yej  foralll  =  l,...,t. 

Putting  together  the  chains  thus  constructed  in  each 
of  the  intervals  of  partial  clones  [Ct+i ,  Ct\  for  all  2  < 
t  <  k  —  1,  we  get  a  maximal  chain  of  length  2*-1  + 
2*~2  + . . .  +  2.  On  the  top  of  such  a  chain  one  adds  the 
maximal  chain  pPol  {0}  C  Par(k)  to  obtain  a  maximal 
chain  of  length  2*  of  partial  clones  containing  the  set 
of  all  idempotent  partial  functions  1  on  k. 
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2nd  edition  (1984)  150-192. 


Remark  Very  recently,  D.  Lau  provided  the  first 
author  with  a  proof  of  the  following  result:  there  are 
finitely  many  partial  clones  (strong  or  not)  that  con¬ 
tain  the  clone  Q  Pol  {*}  of  all  idempotent  functions 

z€k 

on  k.  As  P|  Pol  {x}  is  a  finitely  generated  clone,  it 

a:€k 

follows  that  every  partial  clone  containing  P|  Pol  {x} 

is  a  finitely  generated  partial  clone.  Corollary  5  is  a 
particular  case  of  this  result. 
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Partial  Clones  and  their  Generating  Sets 


Lucien  Haddad  *  Dietlinde  Lau  ** 


Abstract 

We  present  some  of  our  recent  results  on  partial  clones. 
Let  A  be  a  non  singleton  finite  set.  For  every  maximal 
clone  C  on  A,  we  find  the  maximal  partial  clone  on  A 
that  contains  C.  We  also  construct  families  of  finitely 
generated  maximal  partial  clones  as  well  as  a  family 
of  not  finitely  generated  maximal  partial  clones  on  A. 
Furthermore ,  we  study  the  pairwise  intersections  of  all 
maximal  partial  clones  of  Shipecki  type  on  A. 


1  Introduction  and  Preliminaries 


In  the  sequel  k  >  2  and  A  is  a  fc-element  set.  For  a 
positive  integer  n,  an  n-ary  partial  function  on  A  is  a 
map  /  :  dom  (/)  -t  A  where  dom  (/)  is  a  subset  of  An 
called  the  domain  of  f.  Let  V ^  denote  the  set  of  all 
n-ary  partial  functions  on  A  and  let  Va  :=  U  V^\ 


n>  1 


Moreover  set  O ^  :=  j/  G  V ^  |  dom  (/)  =  An  j  and 
Oa  •=  U  O^a*  •  O a  is  the  set  of  all  functions  on  the 


n>l 


set  A. 

For  n,m  >  1,  /  G  V^  and  gi,...,gn  €  V^\  the 
composition  of  /  and  <?i, . . .  ,  denoted  f\gi, . . .  ,gn] 
is  the  m-ary  partial  function  on  A  defined  by 

n 

dom  (f[gi, p„])  :=  {a  G  Am  \  a  G  Q  dom  (g{)  and 


i=  1 


and  moreover 


(; 9i(a ),  •  •  • ,gn{3 ))  G  dom  (/)}; 


f[9i,  ■  •  •  ,9n](a)  f  (91(a), . .  .,gn(a)) 

for  all  a  G  dom  (f[gi 

For  every  positive  integer  n  and  each  1  <  i  <  n,  let  e? 
denote  the  n-ary  i-th  projection  defined  by  dom  (ef)  = 
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An  and  e”(ai, . . . , an)  =  ai  for  all  (oi, . . . , an)  €  An. 
Furthermore  let 

Ja  :=  {e”  |  1  <  %  <  n  <  00} 
be  the  set  of  all  projections. 

Definitions  A  partial  clone  on  A  is  a  composition 
closed  subset  of  Va  containing  Ja •  If  a  partial  clone 
C  is  contained  in  the  set  of  all  functions  Oa  >  then  it  is 
called  a  clone  on  A. 

A  partial  clone  (a  clone  )  C  is  said  to  be  a  maximal 
partial  clone  ( maximal  clone)  if  C  C  Co  C  Va  (C  C 
Cq  C  Oa)  for  no  partial  clone  Co  (for  no  clone  Co)  on 
A. 

I.  G.  Rosenberg  described  all  maximal  clones  on  A  in 
[21],  (see  also  [22]  and  [23]).  The  description  is  in  terms 
of  polymorphisms  of  relations,  these  are  defined  as  fol¬ 
lows. 

Let  h  >  1,  g  be  an  h- ary  relation  on  A  (i.e.,  a  subset  of 
Ah)  and  /  be  an  n-ary  partial  function  on  A.  Denote 
by  ,M(dom  (/),  q)  the  set  of  all  h  x  n  matrices  M  on 
A  whose  columns  M*j  G  g,  for  j  =  1, . . . ,  n  and  whose 
rows  Mi*  €  dom  (/)  for  i  =  1, . . . ,  h.  We  say  that  the 
partial  function  /  preserves  the  relation  g  if  for  every 
M  €  .M(dom  (f),e),  (f(Mu),...,f(Mh.))  G  g. 

Set  pPol  q  :=  {/  G  Va  \  f  preserves  q)  and  Pol  g  := 
pPol  g  fl  Oa  (he.,  Pol  g  is  the  set  of  all  functions  that 
preserve  the  relation  g).  It  is  well  known  and  easy  to 
show  (see  e.g.,  [12],  [17]  and  [21])  that  pPol  g  (Pol  g) 
is  a  strong  partial  clone  on  A  (a  clone  on  A)  called  the 
partial  clone  determined  by  g  (the  clone  determined  by 
Q )• 

A  partial  clone  C  is  said  to  be  strong  if  it  contains  all 
subfunctions  of  its  functions,  i.e.,  if  for  every  /  G  C 
and  g  G  VA ,  [5  =  /Idem  (g)  =>  9  €  C]. 

For  every  set  F  C  Va,  let  Str  ( F )  denote  the  strong 
closure  of  Fy  i.e., 

Str  (F)  :=  {g  £  Va  |  9  is  a  subfunction  of  some  /  G  F} 
Furthermore  let  ( F )  denote  the  partial  clone  generated 
by  F,  i.e.,  the  intersection  of  all  partial  clones  on  A 
containing  the  set  F.  If  C  =  (F),  then  we  say  that  F 
is  a  generating  set  for  C.  A  partial  clone  C  is  said  to 
be  finitely  generated  if  it  admits  a  finite  generating  set; 
i.e.,  if  C  =  (F)  for  some  finite  set  F  C  C.  Finally,  a 
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partial  clone  C  is  called  of  Shipecki  type  if  C  contains 
the  set  O ^  of  all  unary  functions  on  A. 

The  purpose  of  this  paper  is  to  present  the  recent  re¬ 
sults  obtained  by  the  two  authors  in  [6]  and  [7].  As 
many  of  the  results  in  [6]  are  based  on  Rosenberg’s 
classification  of  all  maximal  clones  on  A,  we  need  to 
state  Rosenberg’s  Theorem.  Let  ft  >  2  be  an  integer. 
Put 

Th  :=  {(ai,  •  •  •  >a/0  €  Ah  |  a,i  =  dj 

for  some  1  <  i  <  j  <  ft} 
If  q  is  an  ft- ary  relation  on  A  and  Sh  denotes  the  set  of 
all  permutations  on  {1, . . . ,  ft},  then  for  i t  E  Sh  let 

:=  {(*,(1),..., *»(*))  |  (xi,...,Xh)  €  p}. 

An  ft- ary  relation  q  is  said  to  be 

1.  totally  symmetric  ( symmetric  in  the  case  ft  =  2)  if 
qW  =  q  for  every  7r  £  Sh  i*e.,  if 

(#1, .  . .  ,£/i)  £  Q  > — V  (^tt(I) j  •••) xn(h))  €  Q 

for  all  7r  6  Sh- 

2.  totally  reflexive  ( reflexive  in  the  case  ft  =  2)  if 
Th  Q  Q  * 

3.  prime  affine ,  if  ft  =  4  and  there  are  a  prime  number 
p  and  an  integer  m  >  1  such  that  k  —  pm  and 

£  :=  {(fli,  cb2->  035^4)  ^  A4  |  a\  +  &2  =  &3  +  fl4}> 
where  (A, +)  is  an  elementary  Abelian  p-group  (i.e., 
x  +  x  +  ...  +  x  =  0  for  every  element  x  €  A,  where  0  is 

p  times 

the  identity  element  of  the  group). 

4.  central ,  if  g  ^  Afc,  0  is  totally  symmetric,  totally 
reflexive  and,  if  ft  >  2,  there  exists  an  element  c  £  A 
such  that  {c}  x  Aa_1  C  0.  (The  element  c  is  called  a 
central  element  of  0.) 

5.  elementary ,  if  fc  =  ftm,  m  >  1,  ft  >  3  and 
(ai,  fl2  . . . ,  dh)  £  0  4=^ 

Vi  €  {0,...,m  —  1} 

where  (a  £  {0, 1, . . . ,  ftm  - 1})  denotes  the  i-th  digit 
in  the  expansion 

a  =  •  ft m"1  +  a(m”2)  •  ftm“2  +  . . . 

•  ft  +  a^0^. 

6.  a  homomorphic  inverse  image  of  an  h-ary  relation  gf 
on  A',  if  there  exists  a  surjective  mapping  q  :  A  — >  A! 
with 

(ai , . . . ,  dh)  £  0  > — <  (^(ai)j  •  -  - 1  Q{ah))  €  Q 
for  all  £  A. 

7.  h-universal ,  if  0  is  a  homomorphic  inverse  image  of 
an  ft-ary  elementary  relation. 

Furthermore,  let 

£4  be  the  set  of  all  ft-ary  central  relations  on  A, 
1  <  ft  <  fc  —  1 ; 

Ua  be  the  set  of  all  non  trivial  equivalence  relations 
on  A  ; 


SA  :=  U  Sk,P  ,  where  Sk,p  :=  {{(x,  s(x))  |  x  £ 

pyp\k 

A}  |  s  £  j Pk,p}  and  Pk,p  is  the  set  of  all  permutations 
on  A  consisting  of  cycles  of  the  same  prime  length  p  ; 
Ma  be  the  set  of  all  bounded  partial  order  relations 
on  A; 

d A  be  the  set  of  all  prime  affine  relations  on  A, 

Ba  be  the  set  of  all  ft-universal  relations,  3  <  ft  <  fc. 

Here  is  Rosenberg’s  Theorem. 

Theorem  1  [23]  Let  k  >  2  and  A  be  a  k-element  set 
Every  proper  clone  on  A  extends  to  a  maximal  one .  If 
M  is  a  clone  on  A,  then  M  is  a  maximal  clone  if  and 
only  if  M  —  Pol  q7  for  some  g  £  Ca  U  Ma  V$a  UUa  U 
CaU  Ba-  d 

In  the  sequel  M  stands  for  the  set  of  all  relations  that 
determine  maximal  clones  on  A.  Thus  by  Theorem  1, 
M  :=  CaUMaVUaUSaUCaU  &a- 

More  recently,  the  maximal  partial  clones  on  A  were 
described  by  several  authors.  We  refer  the  reader  to  [6], 
[11],  [12],  [13]  or  [18]  for  a  description  of  all  maximal 
partial  clones  on  A. 

Two  problems  are  addressed  in  [6],  they  both  concern 
the  partial  clones  pPol  g  where  g  £  M  is  one  of  the 
relations  in  Rosenberg’s  Theorem.  First,  is  pPol  g  a 
maximal  partial  clone  on  A  and  if  not  which  maximal 
partial  clone  on  A  contain  Pol  0?  Second,  is  pPol  0  a 
finitely  generated  partial  clone  on  A? 

In  [7],  intersections  of  maximal  partial  clones  of 
Slupecki  type  on  A  are  studied.  It  is  shown  that  with 
one  exception,  the  intersection  of  two  strong  maximal 
partial  clones  of  Slupecki  type  on  A  is  covered  by  both 
those  two  maximal  partial  clones.  Furthermore,  it  is 
shown  that  the  situation  is  quite  different  if  the  unique 
non-strong  maximal  partial  clone  of  Slupecki  type  on 
A  is  involved  in  the  intersection. 

2  Partial  clones  containing  maximal 
clones 

Unless  otherwise  specified,  all  results  presented  in  this 
section  are  established  in  [6].  Let  M  be  a  maximal 
clone  on  A.  Since  every  partial  clone  on  A  extends 
to  a  maximal  one  (see  [10]),  it  is  natural  to  ask  how 
many  maximal  partial  clones  on  A  contain  the  clone 
M?  The  following  is  shown  using  a  result  established 
by  Freivald  in  [5]: 
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Proposition  2  Let  M  be  a  maximal  clone  on  A.  Then 
M  is  contained  in  exactly  one  maximal  partial  clone  on 
A.  □ 

It  follows  that  for  every  g  £  M,  either  pPol  £  is  a  max¬ 
imal  partial  clone  on  A  or  there  is  a  unique  maximal 
partial  clone  on  A  that  contains  pPol  g.  The  result  be¬ 
low  shows  that  of  the  six  families  of  relations  that  deter¬ 
mine  maximal  clones,  four  determine  maximal  partial 
clones  on  A. 

Proposition  3  Let  M2  :=  Sa\Su,2  and  g  £  M\(£yiU 
M2).  Then  pPol  g  is  a  maximal  partial  clone  on  A. 
Furthermore ,  if  k  =  2m  for  some  m  >  1,  then  pPol  g 
is  a  maximal  partial  clone  on  A  for  every  g  £  £2m .  □ 

The  relations  in  Sk,p  U  Ca  determine  maximal  clones 
but  do  not  determine  maximal  partial  clone  on  A .  Now 
given  g  £  Sk,p  U  Ca,  one  can  find  the  unique  maximal 
partial  clone  on  A  that  contain  pPol  g.  Starting  with 
g8  :=  {(xy  s(x))  |  x  £  A}  £  <Sfc,p,  we  have 

Proposition  4  Let  p  >  3  be  a  prime  num¬ 

ber ,  s  be  a  permutation  on  A  consisting  of  cy¬ 
cles  of  the  same  prime  length  p  and  g6}P  := 
{(a;,s(a;),s2(:z), . . . ,  sp~1(x))  \  x  £  A }.  Then  pPol  g8  C 
pPol  g8iP  and  pPol  gk,p  is  a  maximal  partial  clone  on 
A.  □ 

We  turn  to  the  family  Ca  of  all  prime  affine  relations 
on  A.  Let  A  be  such  a  relation.  The  maximal  clone 
Pol  A  is  described  in  [22]  (see  also  [6]).  As  mentioned 
above,  the  quaternary  relation  A  is  defined  in  terms 
of  a  binary  function  +  on  A  such  that  (A,  +)  is  an 
elementary  Abelian  p-group.  It  is  well  known  that  this 
implies  that 

(A,+)  -  (p,+)x...x(p,+),  where p  :=  {0,...,p-l} 
and  (p,  +)  is  the  cyclic  group  mod  p.  W.l.o.g  assume 
A  =  pm  and  consider  the  p-ary  relation  Ap  defined  on 
Pm  by 

Ap  :=  {(a,  a  +  6,  a  -f-  2  *  6, . . . ,  a  +  (p  —  1)  •  b)  | 

& ?  b  £  pm}, 

then 

Proposition  5  pPol  Ap  is  a  maximal  partial  clone  on 
A  =  pm  that  properly  contains  the  partial  clone  pPol  A. 
□ 

The  rest  of  this  section  is  devoted  to  discuss  generating 
sets  for  the  partial  clones  determined  by  the  relations 
in  the  set  M.  We  refer  the  reader  to  [2]  for  a  brief 
survey  on  generating  sets  for  clones. 

The  second  author  has  shown  in  [14]  that  the  max¬ 
imal  clone  Pol  g  is  finitely  generated  on  A  for  all 


g  £  M  \  Ma-  In  view  of  this  result,  it  is  natural  to  ask 
whether  the  partial  clone  pPol  g  for  g  £  M  is  finitely 
generated.  The  methods  used  to  tackle  this  question 
come  from  [2]  (see  also  [3]).  A  slight  modification  of 
the  concept  of  a  separating  clone,  introduced  in  [2],  is 
given  in  [6]. 

Definition  Let  r  >  1  be  an  integer.  A  clone 
C  on  A  is  an  r- separating  clone  if  for  all  n  >  1, 
there  exist  g\ , . . . ,  gT  £  C  fl  O ^  such  that  the  map 
g  :  An  -*  Ar  defined  by  a  ->  (pi(a), . . .  ,pr(a))  satisfies 
tf""1  (<?(£))  =  {b}  for  all  b  £  An ,  (i.e.,  g  takes  the  value 
g(b)  only  once).  Moreover,  a  clone  is  a  separating  clone 
if  it  is  an  r- separating  clone  for  some  r  >  1  ([2],  [3]). 

Of  the  six  families  of  maximal  clones  on  A,  five  consist 
of  separating  clones.  Let  M\  be  the  set  of  all  binary 
relations  g  on  A  such  that  (A,  g)  is  a  lattice.  We  have 

Lemma  6  For  all  g  £  CaUUaUSaUMa^M^UBa, 
there  is  an  r  £  {1, 2}  such  that  Pol  g  is  an  r -separating 
clone  on  A.  □ 

Remark  The  Lemma  above  leaves  out  the  class  of 
relations  Ca .  In  fact  it  is  shown  in  [6],  Lemma  17,  that 
the  maximal  clone  Pol  g  is  not  a  separating  clone  for 
all  g  £  Ca  • 

The  concept  of  r-separating  clone  is  quite  useful  to  de¬ 
termine  whether  a  partial  clone  of  the  form  Str  C  is 
finitely  generated,  where  C  is  a  clone  on  A.  We  need 
the  following  terminology.  If  a  clone  C  is  finitely  gen¬ 
erated,  then  the  order  of  (7,  written  ord  (C),  is  the 
minimal  positive  integer  r  such  that  (C^)  =  C.  Us¬ 
ing  results  from  [2],  one  can  show 

Lemma  7  Let  r  >  1  and  C  be  a  finitely  gen¬ 

erated  r-separating  clone  on  A.  Then  the  partial 
clone  Str  ( C )  is  finitely  generated  and  ord  (Str  (<7))  < 
maa;{ord  (C),r}.  □ 

Now  in  order  to  apply  the  two  Lemmas  above  on  the 
clones  determined  by  the  relations  in  Rosenberg’s  The¬ 
orem,  one  must  examine  the  relations  g  £  M  for  which 
the  equality  Str  (Pol  g)  =  pPol  g  holds.  Almost  all  of 
the  following  result  is  established  by  Romov  in  [20]. 
However  the  proofs  given  in  [20]  rely  on  relational  con¬ 
structions  as  well  as  on  many  different  results.  Set 
S\  :=  Sk,2  U  Sk, 3-  We  have: 

Lemma  8  (a)  Vp  £  Str  (Pol  g)  = 

pPol  g  and 

(b)  Vee(SA\SX)U{MA\Mi)UCA,  Str  (Pol  e)t 
pPol  g.  □ 
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The  proof  of  Lemma  8  given  in  [6]  is  direct  in  the  fol¬ 
lowing  sense:  it  is  shown  that  for  every  g  €  Ca  U  Ua  U 
S*A  U  M*a  and  every  partial  function  /  €  pPol  g,  f  can 
be  extended  to  a  function  in  Pol  g.  Also,  it  is  shown 
that  for  every  g  €  (Sa  \  SA)  U  {Ma  \  M*A )  U  Ca,  there 
is  a  function  g  €  pPol  g  that  cannot  be  extended  to  a 
function  in  Pol  g. 

We  mentioned  earlier  that  the  second  author  has  shown 
that  five  of  the  six  families  of  relations  in  Rosenberg’s 
Theorem  determine  finitely  generated  maximal  clones 
on  A.  The  class  of  all  bounded  order  is  left  out,  in  fact 
Tardos  has  provided  in  [26]  a  bounded  order  relation 
^  on  an  8-element  set  for  which  the  maximal  clone 
Pol  -<  is  not  finitely  generated.  Nozaki  and  Lashkia 
have  shown  recently  that  a  partial  clone  determined  by 
any  order  relation  is  finitely  generated,  indeed 

Theorem  9  ([15])  Let  g  €  Ma-  Then  the  maximal 
partial  clone  pPol  g  has  order  2.  □ 

Theorem  9  provides  a  family  of  finitely  generated  par¬ 
tial  clones  on  A.  There  are  more  such  families  as  the 
following  result  shows.  For  2  <  h  <  k,  let  CA  denote 
the  set  of  all  fi-ary  central  relations  on  A.  We  have 

Theorem  10  a)  Let  |A|  =  2.  If  g  €  Sa  then 
ord  (pPol^e)  =  3  and  if  g  6  Ca,  then  ord  (pPol^)  = 
2. 

b)  Let  \A\  =  k  >  3. 

i)  If  g  £  UaUS*a\JC1a\JC2a\JM*a,  then  ord  (pPol  g)  = 
2,  and 

ii)  if  3  <  h  <  k-l  and  g  6  C\,  then  ord  (pPol  g)  < 
h,  and 

iii)  if  g  6  Ma  U  Ba,  then  ord  (Str  (Pol  g))  =  2.  a 

Some  of  the  partial  clones  in  Theorem  10  are  maximal. 
In  fact  a  combination  of  Proposition  3,  Theorem  9  and 
Theorem  10  gives 

Theorem  11  Let  k  >  3,  A  be  a  k-element  set  and 
g  eUAVCkUMA-  Then  pPol  g  is  a  finitely  generated 
maximal  partial  clone  on  A.  D 

One  more  family  of  finitely  generated  maximal  partial 
clones  on  A  is  presented  in  [6],  indeed 

Theorem  12  Let  p>  3  be  a  prime  number,  m  >  1,  A 
be  a  pm-element  set,  s  be  a  permutation  with  cycles  of 
the  same  prime  length  p  on  A  and 
gs  :=  {(a,  s(a), . . . ,  sp-x(a))  |  a  €  A}.  Then  the  maxi¬ 
mal  partial  clone  pPol  gs  has  order  2.  □ 

The  problem  of  deciding  whether  pPol  g  is  a  finitely 
generated  partial  clone  for  g  one  of  the  relations  in 
Rosenberg’s  Theorem  is  also  settled  in  [6]  for  the  family 
Ca  of  all  prime  affine  relations  on  A.  Here,  we  make 
use  of  the  following  criterion  established  in  [2]: 


Theorem  13  ([2],  see  also  [3])  Let  A  be  a  non¬ 
singleton  finite  set  and  C  be  a  strong  partial  clone  on 
A.  If  C  fl  Oa  is  not  a  separating  clone,  then  C  is  not 
finitely  generated  on  A.  n 

Now  it  is  shown  in  [6]  that  the  maximal  clone  Pol  A  is 
not  a  separating  clone  for  any  A  C a,  consequently 

Theorem  14  Let  C  be  a  strong  partial  clone  with  C  ft 
Oa  =  Pol  A  for  some  A  6  Ca-  Then  C  is  not  finitely 
generated.  D 

We  point  out  here  that  a  direct  proof  to  Theorem  14, 
without  using  the  criterion  in  Theorem  13,  is  presented 
in  [6]. 

We  conclude  this  section  with  the  following  remark.  A 
result  similar  to  Theorem  14  for  the  family  of  relations 
Ba  is  missing.  Let  g  e  Ba-  It  is  shown  in  [20]  that 
Str  (Pol  g)  /  pPol  g,  and  Lemma  6  shows  that  Pol  g 
is  a  separating  clone.  Thus  neither  Theorem  10  nor 
Theorem  13  applies  for  pPol  g.  We  have  a  feeling  that 
pPol  g  is  a  not  finitely  generated  maximal  partial  clone 
on  A. 

3  Intersections  of  Slupecki  type  maxi¬ 
mal  partial  clones 

In  this  section  we  present  the  results  of  [7].  If  C  and 
C'  are  two  partial  clones  on  A  with  C  C  C1 ,  then  by 
the  interval  of  partial  clones  [C,  C']  we  mean  the  set 
{D  |  D  is  a  partial  clone  on  A  and  C  C  D  C  C'}. 
As  defined  earlier,  a  partial  clone  is  of  Slupecki  type  if 
it  contains  the  set  of  all  unary  functions  on  A. 
For  example,  if  pn  €  V{A]  denotes  the  n-ary  partial 
function  with  empty  domain,  then 

Ha  -=  Cl  a  U  {Pn  I  a  >  1}, 

is  a  Slupecki  type  partial  clone  on  A.  The  definition  of 
a  Slupecki  type  partial  clone  is  motivated  by  the  well 
known  Slupecki  criterion,  which  states  that  Pol  r*  is 
the  unique  maximal  clone  that  contains  the  set  Oa  '  of 
all  unary  functions  on  A  (the  relations  rh  have  been 
defined  in  Section  1).  Burle  improved  this  result  in  [4] 
by  proving  that  the  interval  of  all  clones  of  Slupecki 
type  on  A  is  simply  the  chain 

(0(A})  c  Pol  R\  c  Pol  t3  c  ...  c  Pol  n  c  Oa, 
where 

Ri  =  {( x,y,z,t )  6  A4  |  [x  =  y  and  z  =  t]  or 

[x-z  and  y  =  f]  or  [a;  =  t  and  y  =  z]}. 
As  O ^  is  a  finite  set,  one  can  easily  obtain  (e.g.,  see 
[2]  and  [3])  that  every  clone  of  Slupecki  type  on  A  is 
finitely  generated. 

The  situation  is  quite  different  in  the  partial  case. 
It  is  shown  in  [12]  (and  independently  in  [17])  that 
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there  are  k  +  1  Slupecki  type  maximal  partial  clones 
on  the  ^-element  set  A ,  namely  pPoliii,  pPoli?2, 
pPol  r3, . . . ,  pPol  Tk  and  Ha,  where 
i?2  =  {(x,  V ,  2,  t)  €  A4  |  [x  =  2/  and  z  —  t]  or 

[x  =  £  and  y  =  2]}. 
Notice  that  of  all  these  &  +  1  maximal  partial  clones, 
only  Ha  is  a  non  strong  partial  clone.  Furthermore, 
it  is  shown  in  [8]  (see  also  [3])  that  the  family  of  all 
Slupecki  type  partial  clones  on  A  is  of  continuum  car¬ 
dinality.  Finally,  none  of  the  maximal  partial  clones 
pPol  i?i,  pPol  R2 ,  pPol  t*3,  . . .  ,pPol  Tk  is  finitely  gen¬ 
erated  is  shown  in  [2]  (see  also  [3]). 

The  conditions  to  recognize  finitely  generated  partial 
clones  established  in  [2]  apply  for  partial  clones  C 
whose  “Oa  components”  CDOa  are  either  non  sepa¬ 
rating  or  not  finitely  generated  clones.  Let  C\  ^  C2  be 
two  Slupecki  type  maximal  partial  clones  on  A .  Then 
it  is  easy  to  see  that  the  clone  ( C\  D  C2)  0  Oa  is  a  sep¬ 
arating  finitely  generated  clone,  and  so  the  results  in 
[2]  cannot  be  used  to  decide  whether  the  partial  clone 
Ci  fl  C2  is  finitely  generated.  This  is  solved  in  [7].  We 
say  that  a  partial  clone  Cf  covers  a  partial  clone  C  on 
A  if  C  C  C"  and  [C,  C]  =  {C,C'}.  One  of  the  main 
results  in  [7]  is 

Theorem  15  Let  k  >  2,  A  be  a  k-element  set  and 
C±C*  be  two  strong  maximal  partial  clones  of  Slupecki 
type  on  A .  If  (C,C")  ^  (pPol  Ri,  pPol  R2 ),  then  the 
partial  clone  C  fl  C"  is  covered  by  the  maximal  partial 
clone  Cf .  a 

Theorem  15  is  shown  by  several  steps.  A  main  tool  used 
in  the  proof  is  the  Definability  Lemma  established  in 
[17]  (see  [9]  and,  for  a  weak  version  of  it,  see  [12]). 

The  result  of  Theorem  15  above  does  not  hold  for 
C  =  pPol  Ri  and  C'  =  pPol  R2.  Indeed  it  is  shown 
in  [7]  that  there  is  at  least  one  partial  clone  that 
strictly  lies  in  the  interval  of  partial  clones  [pPol  R\  fl 
pPol  i?2,  pPol  #2]*  Now  even  though  the  size  of  the 
interval  of  partial  clones  [pPol  R\  fl  pPol  R2,  pPol  R2] 
is  not  known,  a  combination  of  Theorem  15  with  the 
fact  that  none  of  the  k  strong  maximal  partial  clones 
of  Slupecki  type  on  A  is  finitely  generated  gives  the 
following 

Theorem  16  Let  k  >  2,  A  be  a  k-element  set  and 
C  ^C'  be  two  strong  maximal  partial  clones  of  Slupecki 
type  on  A.  Then  the  partial  clone  CnC'  is  not  finitely 
generated  on  A.  □ 

The  main  purpose  of  [7]  is  to  study  the  pairwise  in¬ 
tersections  of  Slupecki  type  maximal  partial  clones  on 
A .  Since  f is  one  of  them,  its  intersection  with  all 


other  Slupecki  type  partial  clones  is  studied  in  that  pa¬ 
per.  In  fact  this  question  reduces  to  the  following:  let 
q  e  {i?i,J?2,T3,. . .  Describe  the  interval  of  par¬ 
tial  clones  [Pol  0,  pPol  q]  on  A .  This  type  of  problem 
has  been  tackled  by  the  authors  of  [1],  [24]  and  [25]  for 
A  =  {0, 1},  as  well  as  by  the  first  author  in  [8]. 

One  of  the  results  shown  in  [1]  is  that  the  interval  of 
partial  clones  [Str  (Poli?i),  pPoli?i]  is  of  continuum 
cardinality  on  {0, 1}.  For  l  >  2,  let 
[Str  (Pol  #1),  pPol  Rx]i  := 

{C  |  C  is  a  partial  clone  on  the  ^-element  set  A  and 
Str  (Pol  Ri)  C  C  C  pPol  Rx}. 

A  one-to-one  map 

<j> :  [Str  (Pol  i?i),  pPol  Ri]2  ->  [Str  (Pol  i?i),  pPol  ill]* 
is  constructed  in  [7]  for  every  k  >  3.  As 

[Str  (Pol  i?i),  pPol  R\\  C  [HA  D  Pol  i?i,  pPol  R{\,  we 
obtain 

Proposition  17  Let  k  >  2  and  A  be  a  k-element 
set  Then  the  interval  of  partial  clones  [fl^  fl 
pPol  R\ ,  pPol  ili]  has  the  cardinality  of  continuum  on 
A.  □ 

Using  families  of  relations  defined  in  [13],  the  first  au¬ 
thor  shows  in  [8]  (Theorem  4)  a  result  similar  to  Propo¬ 
sition  17  for  the  relation  R2. 

We  turn  to  the  family  of  relations  {r3, . . . ,  r*}.  For 
n  >  2  and  h  >  3,  let  cr2n+fc  be  the  relation  of  arity 
(2n  +  h)  on  the  set  {1, . . . ,  2n  -h  h}  defined  by 

(xi,...,X2n+/0  €  02n+/i  |{®li*-*i®2n+fc}|  < 

h-  1  or  |{xi, . . .  ,x2n+/i}|  =  h  with  1)  h  -  2  symbols 
occurring  each  once  and  2)  one  symbol  occurring  ex¬ 
actly  twice  and  3)  one  symbol  occurring  2n  times  in 

X\ , . . . ,  X2 n+h  . 

The  inclusions  Ha  H  pPol  Tu  C  pPol  a2n+h  Q  pPol  Th 5 
for  every  n  >  2  and  every  3  <  h  <  k  are  shown  in 
[7].  The  same  paper  gives  a  family  {(j>2m+h  :  m  >  2} 
of  partial  functions  on  A  such  that  for  n,  m  >  2, 
$2m+h  €  pPol  <T2n+/t  <=*  n  7*  ThuS 

Proposition  18  Let  k  >  3,  A  be  a  k-element  set  and 
3  <h<k.  Then  the  interval  of  partial  clones  [fi^  fl 
pPolr/j,  pPol  Th]  has  the  cardinality  of  continuum  on 
A .  □ 

A  combination  of  the  results  above  gives 

Theorem  19  Let  k  >  2,  A  be  a  k-element  set  and 
pPol  q  be  any  maximal  partial  clone  of  Slupecki  type 
on  A.  Then  the  interval  of  partial  clones  [Ha  fl 
pPol  q ,  pPol  g]  has  the  cardinality  of  the  continuum  on 
A.  □ 
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Remark  A  direct  consequence  of  Theorem  19  is  that 
for  \A\  >  3  and  every 

g  £  {i?i}i?2>'r3j***>TJfe}5  the  interval  of  partial  clones 
[Pol  £,  pPol  g]  is  of  continuum  cardinality  on  A.  So 
one  may  ask  the  following  general  problems 

1)  let  g  £  M  be  one  of  the  relations  that  determine  a 
maximal  clone  on  A.  Determine  the  interval  of  partial 
clones  [Pol  g ,  pPol  g]  on  A .  Dually 

2)  let  g  be  a  relation  such  that  pPol  g  is  a  maximal 
partial  clone  on  A.  Determine  the  interval  of  partial 
clones  [Pol  g ,  pPol  g\  on  A. 
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Abstract 

This  paper  compares  the  complexity  of  three  differ¬ 
ent fixed  polarity  generalizations  of  Reed-Muller  canonical 
form  to  multiple-valued  logic:  the  Galois  Field-based  ex¬ 
pansion  introduced  by  Green  and  Taylor,  the  Reed-Muller- 
Fourier  form  of  Stankovic  and  Moraga,  and  the  expansion 
over  addition  modulo  m,  minimum  and  the  set  of  all  lit¬ 
eral  operators  introduced  by  the  author  and  Muzio.  An 
algorithm  for  computing  the  minimal  canonical  forms  for 
these  generalizations  is  implemented  and  applied  to  a  set 
of  encoded  4-valued  benchmark  functions,  3-  and  4-valued 
adders  and  multipliers.  The  experimental  results  show  that, 
for  the  benchmark  functions,  the  Reed-Muller-Fourier form 
and  our  expansion  yield  a  comparable  number  of  prod¬ 
ucts  on  average.  They  have  40%  less  products  on  average 
than  the  expansion  of  Green  and  Taylor.  The  Reed-Muller- 
Fourier  form  gives  a  compact  representation  for  adders, 
while  our  expansion  seems  to  be  suitable  for  multipliers. 


1.  Introduction 

Two-level  expressions  of  multiple-valued  logic  functions 
and  their  minimization  has  been  a  subject  of  active  research 
for  many  years.  This  problem  is  important  because  it  pro¬ 
vides  a  means  for  optimizing  the  implementations  of  cir¬ 
cuits  that  are  direct  translations  of  two-level  expressions 
([l]-[4]).  Thus,  two-level  logic  representations  have  direct 
impact  on  macro-cell  design  styles  using  programmable 
logic  arrays  (PLAs). 

We  can  classify  two-level  expressions  of  multiple- valued 
logic  functions  into  two  major  groups.  The  first  one  in¬ 
cludes  logic  representations  evolved  from  classical  sum-of- 
products  form  of  Boolean  functions  over  a  Boolean  alge¬ 


bra.  An  example  is  the  canonical  form  of  multiple-valued 
functions  in  a  Post  algebra  over  the  set  of  operations  (MIN, 
MAX,  literal)  [5].  The  second  group  includes  logic  repre¬ 
sentations  based  on  generalization  of  modulo  2  polynomial 
over  a  Galois  Field  GF{ 2),  often  referred  to  as  Reed-Muller 
canonical form  [6],  [7].  In  this  paper  we  consider  three  dif¬ 
ferent  generalizations  of  the  Reed-Muller  canonical  form  to 
multiple-valued  logic  and  compare  their  complexity. 

The  Reed-Muller  canonical  form  can  be  extended  to 
multiple-valued  logic  in  several  ways,  depending  on  how 
its  operations  are  generalized.  Many  extensions  have  been 
suggested,  including  [8]-[13].  In  these  extensions,  AND 
and  XOR  operations  (which  are  equivalent  to  multiplica¬ 
tion  and  addition  modulo  2,  correspondently)  are  general¬ 
ized  to  addition  and  multiplication  in  GF(m).  This  is  a 
natural  extension,  allowing  to  preserve  the  nice  properties 
of  fields.  However,  such  generalizations  are  only  applica¬ 
ble  to  algebras  with  m  being  a  prime  or  a  power  of  a  prime 
number.  This  doesn’t  cover,  for  example,  the  interesting 
case  of  m  =  10.  In  [14]  we  introduced  another  general¬ 
ization  of  the  fixed  polarity  Reed-Muller  canonical  form, 
where  AND  is  extended  to  MIN  operation,  and  XOR  is  ex¬ 
tended  to  addition  modulo  m.  Since  MIN  is  not  distributive 
over  addition  modulo  m,  such  an  algebraic  structure  is  not 
a  field  any  longer,  but  it  is  applicable  to  any  positive  integer 
m. 

From  an  implementation  point  of  view,  the  MIN  op¬ 
eration  is  easier  to  implement  than  the  multiplication  in 
GF(m).  For  example,  in  current-mode  CMOS  technol¬ 
ogy,  MIN  can  be  implemented  using  5  transistors  only  [15]. 
Moreover,  this  implementation  is  independent  of  m,  i.e.  the 
number  of  transistors  does  not  increase  with  increasing  val¬ 
ues  of  m.  On  the  other  hand,  the  implementation  of  multi¬ 
plication  in  GF(m)  is  larger  and  it  always  depends  on  m. 
For  example,  for  m  =  3,  a  current-mode  CMOS  circuit 
realizing  multiplication  modulo  3  consists  of  16  transistors 
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[16]. 

However,  this  implementation  advantage  can  only  be  uti¬ 
lized  if  the  complexity  of  MIN-based  expression  of  an  ra- 
valued  function  is  not  larger  than  the  complexity  of  the 
GF(m)  multiplication-based  expression.  The  purpose  of 
this  paper  is  to  compare  the  complexity  of  our  canonical 
form  to  the  complexity  of  the  canonical  forms  introduced 
in  [1 1]  and  [13].  As  a  measure  of  complexity  we  use  the 
number  of  products  and  the  number  of  literals  per  term  in 
the  expression.  This  choice  is  motivated  by  the  major  op¬ 
timization  objectives  in  combinational  logic  design,  which 
are  area  and  delay  reduction  [17].  When  considering  a  two- 
level  implementation  of  a  sum-of-products-type  expression 
by  a  PLA,  each  row  of  a  PLA  is  in  one-to-one  correspon¬ 
dence  with  a  product-term  of  the  sum-of-products  expres¬ 
sion.  Each  transistor  in  a  row  relates  to  the  literals  of  a 
product-term.  Therefore,  the  primary  goal  of  logic  opti¬ 
mization  is  the  reduction  of  the  number  of  products,  and 
a  secondary  one  is  the  reduction  of  the  number  of  literals. 
Achieving  a  sum-of-products  representation  with  the  min¬ 
imal  number  of  products  and  with  the  minimal  number  of 
literals  corresponds  to  optimizing  both  area  and  delay.  So, 
these  two  parameters  are  good  measures  of  the  complex¬ 
ity  of  a  two-level  expression.  For  our  purpose,  a  minimal 
expression  is  an  expression  with  the  smallest  number  of  lit¬ 
erals  taken  from  the  expressions  with  a  smallest  number  of 
products. 

Using  the  theory  developed  in  [12],  [13]  and  [14]  we  im¬ 
plemented  an  algorithm  for  computing  the  minimal  canon¬ 
ical  forms  for  the  three  different  generalizations  of  Reed- 
Muller  canonical  form:  the  Galois  Field-based  expansion 
introduced  by  Green  and  Taylor  [11],  the  Reed-Muller- 
Fourier  form  of  Stankovic  and  Moraga  [13]  and  our  gen¬ 
eralization  from  [14].  We  applied  the  algorithm  to  a  set 
of  encoded  4-valued  benchmark  functions,  3-  and  4-valued 
adders  and  multipliers.  The  experimental  results  show  that 
for  the  benchmark  functions  and  multipliers  Reed-Muller- 
Fourier  form  and  our  form  have  a  comparable  number  of 
products  on  average.  They  have  40%  less  products  on  aver¬ 
age  than  the  expansion  of  Green  and  Taylor.  Reed-Muller- 
Fourier  form  gives  best  results  for  adders,  contrary  to  our 
form  for  which  the  adders  seem  to  be  the  worst-case.  How¬ 
ever,  our  expansion  seems  to  be  suitable  for  multipliers. 

The  paper  is  organized  as  follows.  In  Section  2,  the  back¬ 
ground  on  Reed-Muller  canonical  form  and  its  generaliza¬ 
tions  is  given.  In  Section  3,  the  algorithm  for  computing 
the  minimal  canonical  forms  from  [11],  [13]  and  [14]  is  de¬ 
scribed.  In  Section  4,  the  experimental  results  are  shown.  In 
the  final  section,  some  conclusions  are  drawn  and  directions 
for  further  research  are  proposed. 


2.  Reed-Muller  canonical  form  and  its  general¬ 
izations 

In  this  section  we  describe  Reed-Muller  canonical  form 
of  Boolean  functions  and  its  generalizations  to  multiple¬ 
valued  logic. 

Reed-Muller  canonical  form  of  Boolean  functions  was 
introduced  in  1954  by  Reed  [6]  and  Muller  [7].  It  is  an 
expression  of  type: 

2n  —1 

/(*!,. ..,X„)  =  ^  "X2  (1) 

i- 0 

where  c*  E  {0, 1}  are  constants,  the  sign  £  stands  for  XOR 
and  stands  for  AND.  (iii2  . . .  in)  is  the  binary  expansion 
of  i  with  ii  being  the  least  significant  digit,  and  Xj  =  1  and 
x]  =  Xj  for  j  E  {0, 1, ... ,  n}.  All  variables  in  (1)  are 
uncomplemented. 

If  the  restriction  on  all  the  variables  being  uncom¬ 
plemented  is  dropped,  then  the  Reed-Muller  canonical 
form  extends  to  fixed  polarity  Reed-Muller  canonical  form, 
which  is  unique  for  a  fixed  polarity  k  E  {0, 1, . . . ,  2n  -  1} 
and  is  given  by: 

2n  —  1 

Hx =  x;  c( .  • . . .  •  (2) 

i= 0 

where  and  {k\k2-..kn)  are  the  binary  ex¬ 

pansions  of  i  and  k,  respectively.  The  term  kjxl?  ,j  E 
{0,1,..., n},  is  defined  by:  =  Xj9  1x]  —  Xj  and, 

kjxV  =  1,  for  kj  E  {0,1}. 

There  are  several  ways  to  extend  the  Reed-Muller  canon¬ 
ical  form  to  m- valued  logic.  The  first  generalization  was 
proposed  by  Cohn  in  1960  [8].  It  is  an  expression  over  a 
ring  of  integers  modulo  m,  with  m  being  a  prime,  namely: 

mn— 1 

f[x  1, . . . ,  Xn)  =  y2  Ci  ©  ©  *2  ©  •  •  •  ©  Xn  .  (3) 

t=0 

j/ 

where  c*  are  constants  over  a  finite  set  of  values  M  — 
{0, 1, . . . ,  m- 1},  the  sign  £  stands  for  addition  modulo  m, 
and  ”0”  stands  for  multiplication  modulo  m.  (hh  . . .  in) 
is  the  m-ary  expansion  of  i,  and  the  term  xlj  denotes  the 
ijth  power  of  the  variable  Xj,j  E  {0, 1, ,  n}. 

Cohn’s  generalization  was  extended  by  Pradhan  [9]  for 
the  case  when  m  is  a  power  of  a  prime,  and  then  further  ex¬ 
tended  by  Kodandapani  and  Setlur  [10]  to  the  fixed-polarity 
case,  where  the  variables  Xj,  j  E  {1, 2, . . . ,  n},  are  repre- 

k  • 

sented  by  all  literals  except  xj,  where  (fcn  . . .  k^k  1)  is  the 
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m-ary  expansion  of  fixed  a  polarity  he  {0, 1 , . . .  ?  mn  - 1 } . 
Recall,  that  the  literal  operator  x,  i  e  M,  is  defined  by 

1  df  j  m  —  1  if  x  —  i 
~  \  0  otherwise 

Green  and  Taylor  [11]  introduced  a  different  extension 
of  Cohn’s  form  to  the  fixed-polarity  case,  where  an  ad¬ 
ditive  transform  xj  +  kj  is  performed  on  each  variable 
Xj,  j  €  {l,2,...,n},  according  to  a  fixed  polarity  k  e 
{0, 1, . . . ,  mn  -  1}.  Harking  and  Moraga  [12]  presented 
an  efficient  algorithm  for  computing  such  an  expansion  for 
different  polarities. 

Stankovic  and  Moraga  [  1 3]  introduced  yet  another  exten¬ 
sion,  called  Reed-Muller-Fourier form,  where  the  restriction 
on  all  the  variables  being  uncomplemented  is  dropped,  and 
the  NOT  operation  is  generalized  to  be  any  reordering  of  the 
elements  of  GF(m ),  giving  ( m\)n  different  fixed  polarities. 

All  of  the  above  described  generalizations  are  only  ap¬ 
plicable  for  the  algebras  with  m  being  a  prime  or  a  power 
of  a  prime  number.  In  [14]  we  introduced  another  gener¬ 
alization  of  the  fixed  polarity  Reed-Muller  canonical  form, 
applicable  to  any  positive  integer  m.  We  showed  that  any 
m-valued  n-variable  function  can  be  expressed  over  the  set 
of  operations  addition  modulo  m,  MIN  and  the  set  of  all 

literal  operators  x,  i  €  M,  in  the  following  fixed-polarity 
canonical  form: 

mn  — 1 

/(*!,...,*»)=  E  ci-  ‘‘41-  *■*£  •••••  kn<n  (4) 

i=0 


where  the  sign  Yl  stands  for  addition  modulo  m ,  and 
stands  for  MIN,  and  the  term  kj  xj  is  defined  by 


kj 


i,.  g(  m-1  ifkj=0 

j  |  k,xt>j  otherwise 


where  Xj,  j  €  {1, 2, ...  ,n},  is  a  multiple-valued  variable 
and  kj,ij  £  M  are  constants. 

In  our  generalization  of  the  polarity,  we  assume  that  each 
variable  Xj,  j  6  {1, 2, . . . ,  n},  is  represented  by  all  literals 

k  • 

except  Xj,  where  (kn  . . .  fofci)  is  the  m-ary  expansion  of  a 
polarity  k  6  {0, 1, . . . ,  to”  -  1}.  For  example,  if  to  =  3, 

polarity  vector  k  =  (021)  implies  that  in  the  canonical  form 

0  2 

xi  is  represented  by  literals  xx  and  xx ,  x2  is  represented  by 

x2  and  x2,  and  X3  is  represented  by  x3  and  x3.  A  similar 
generalization  of  polarity  is  used  in  [10]. 

We  select  three  fixed  polarity  generalizations  of  Reed- 
Muller  canonical  form:  the  form  introduced  by  Green  and 
Taylor  [1 1],  the  Reed-Muller-Fourier  form  of  Stankovic  and 
Moraga  [13]  and  our  generalization  from  [14],  and  compare 
their  complexity.  In  the  next  section  we  describe  the  algo¬ 
rithm  which  we  use  for  computing  these  forms. 


3.  Computation  of  the  minimal  canonical 
forms 

In  this  section  we  present  an  algorithm  for  computing  the 
minimal  canonical  forms  for  the  generalizations  introduced 
in  [1 1],  [13]  and  [14].  The  algorithm  is  based  on  the  theory 
developed  in  [12],  [13],  [14],  and,  apart  from  unifying  the 
previous  results,  brings  no  novelty.  The  reason  for  reporting 
it  here  is  merely  to  give  the  reader  a  clear  picture  of  how 
the  experiment  was  conducted,  so  that  it  can  be  repeated,  if 
needed. 

The  pseudocode  of  the  algorithm  is  shown  in  Figure  2. 
The  same  structure  is  used  for  computing  all  three  canonical 
forms,  with  the  difference  in  the  transformation  matrices 
and  the  operations  used  for  the  computations.  For  example, 
the  transformation  matrices  for  n  =  1  and  m  =  4  are  shown 
in  Figure  1. 

Green  &  Taylor  Stankovic  &  Moraga  Dubrova  &  Muzio 


'100 

1  ‘ 

'  3  0  0  0  ‘ 

'  1  0  0  0  ' 

0  1  1 

1 

3  10  0 

3  10  0 

0  3  2 

1 

3  2  3  0 

3  0  10 

0  2  3 

1 

3  3  11 

3  0  0  1 

Figure  1.  The  transformation  matrices  for 

n  =  1  and  m  =  4- 


The  algorithm  receives  as  an  input  a  cube  cover  of  the 
on-set  of  a  completely  specified  multiple- valued  multiple- 
output  function  /.  It  returns  as  an  output  the  number  of 
non-zero  coefficients  in  a  minimal  canonical  form  of  /.  The 
number  of  non-zero  coefficients  corresponds  to  the  number 
of  products  in  the  expression.  Recall,  that  we  define  mini¬ 
mal  expression  as  the  expression  with  the  smallest  number 
of  literals  among  the  expressions  with  a  smallest  number  of 
products.  An  optimal  polarity  is  the  polarity  of  a  minimal 
expression. 

First,  the  transformation  matrix  TM(n)  is  generated. 
TM(n )  is  a  two-dimensional  array  of  integers  of  size 
7nn  x  7nn,  where  m  is  the  radix  of  logic  and  n  is  the  number 
of  inputs. 

The  arrays  P  and  R  are  two-dimensional  arrays  of  inte¬ 
gers  of  size  mn  x  k,  where  k  is  the  number  of  outputs  of  /. 
The  polarity  vector  P  is  computed  for  a  given  polarity  p  by 
reordering  the  coefficients  of  the  functions  in  a  certain  way, 
depending  of  the  value  of  p  (for  more  details  the  reader  is 
referred  to  [12],  [13]  and  [14]).  R  is  the  resulting  vector  of 
coefficients  of  the  canonical  form.  It  is  obtained  by  multi¬ 
plying  the  transformation  matrix  TM  by  the  polarity  vector 
P.  For  Reed-Muller-Fourier  form  [13]  and  our  form  [14], 
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the  computation  is  carried  out  modulo  m.  For  the  Green  and 
Taylor  form  [1 1],  the  operations  of  multiplication  and  addi¬ 
tion  over  GF(m)  are  used.  For  a  prime  m  these  coincide 
with  modulo  m  operations.  For  the  extended  fields,  when 
m  is  a  power  of  a  prime,  they  differ. 

Notice,  that  the  algorithm  processes  multiple-output 
functions  by  first  fixing  the  polarity,  then  computing  the 
non-zero  coefficients  of  the  expressions  for  all  outputs,  and 
then  evaluating  the  total  number  of  products.  We  keep  track 
of  the  products,  common  for  several  outputs,  are  count  them 
just  once.  Another  possible  strategy  would  be  to  find  an  op¬ 
timal  polarity  for  each  output  function  separately. 


GeneralizedRM(F) 
input:  on-set  F  of  f 

output:  number  of  non-zero  coefficients  in  a  minimal 
canonical  form  of  / 

TM(n) «-  Generate(n); 
bestjproducts  -  oo; 
best-literals  =  oo; 
bestjpolarity  =  0; 
for  (all  polarities  p)  { 

make  the  polarity  vector  P  by  reordering  the 
coefficients  of  f; 

R  =  TM(n)  -  P; 
products  4—  CountProducts(-R); 
literals  4-  CountLiterals(iZ); 
if  ( products  <  best  products)  { 
if  ( literals  <  best-literals)  { 
best-products  =  products; 
bestJiterals  =  literals; 
best  polarity  =  p; 

} 

} 

} 

re-compute  R  and  P  for  best  .polarity; 

TM~l(n)  4-  GenerateReverse(n); 
if  (TM-l(n)-R£P) 

return( 

else 

return  (best  .products); 

Figure  2.  Pseudocode  of  the  algorithm  for 
computing  minimal  canonical  forms. 


After  the  optimal  polarity  is  found,  the  solution  is  veri¬ 
fied  by  performing  the  reverse  transform  TM~l  ( n )  •  R. 


4.  Experimental  results 

We  implemented  the  algorithm  shown  in  Figure  2  and 
applied  it  to  a  set  of  4- valued  benchmark  functions,  and  3- 
and  4-valued  adders  and  multipliers.  The  m-valued  adders 
and  multipliers  were  obtained  by  a  direct  encoding  of  their 
functionality  in  m-valued  logic.  The  4- valued  benchmark 
functions1  were  generated  by  pairing  inputs  and  outputs  of 
benchmarks  and  encoding  them  using  the  scheme  shown  in 
Figure  3.  If  a  function  has  an  odd  number  of  inputs  (or  out¬ 
puts),  then  we  assume  the  existence  of  one  additional  input 
(or  output)  which  has  the  don’t  care  value  for  all  combi¬ 
nations.  Since  our  algorithm  takes  as  an  input  completely 
specified  functions  only,  we  first  run  Espresso  [18]  to  ob¬ 
tain  covers  of  the  original  benchmarks  and  then  apply  the 
encoding  to  the  resulting  covers.  However,  a  similar  strat¬ 
egy  can  be  used  to  obtain  a  set  of  incompletely  specified 
4-valued  functions.  The  fragments  of  Espresso  format  of  a 
Boolean  function  and  the  resulting  4-valued  one  are  shown 
in  Figure  4.  Notice,  that  if  a  single  value  in  a  pair  of  binary 
values  is  don’t  care,  then  this  pair  is  encoded  in  two  differ¬ 
ent  ways.  A  binary  cube  with  r  such  pairs  will  be  encoded 
in  2r  4-valued  cubes. 


pair  of  values  in 
the  original  function 

resulting  value(s)  in 
the  4-valued  function 

00 

0 

01 

1 

10 

2 

11 

3 

-0 

{0,2} 

-1 

{1,3} 

0- 

{0,1} 

1- 

{2,3} 

Figure  3.  Encoding  scheme  used  to  obtain  4- 
valued  input  4-valued  output  benchmarks; 
stands  for  the  don’t  care  value. 


Table  1  shows  the  number  of  non-zero  coefficients  in 
the  expressions  computed  by  the  algorithm  in  Figure  2  for 
the  4-valued  benchmark  functions.  Columns  2  and  3  give 
the  number  of  inputs  n  and  the  outputs  k  of  the  functions. 
The  numbers  of  non-zero  coefficients  in  the  canonical  forms 
[1 1],  [13]  and  [14]  are  shown  in  the  columns  4, 6  and  8,  cor- 
respondently.  To  have  a  more  representative  comparison, 
we  have  also  presented  in  columns  5,  7  and  9  the  relative 
values  in  %,  taking  the  Green  &  Taylor  [11]  form  as  ba- 

1This  set  of  4- valued  benchmarks  is  available  at 
http://www.ele.kth.se/~elena/ 
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Table  1.  Benchmark  results  for  4-valued  functions,  showing  the  number  of  non-zero  coefficients  in 
the  expressions  computed  by  the  algorithm  in  Figure  2. 


Example 

function 

n 

k 

Green  & 
Taylor  [11] 

Stankovic  & 
Moraga  [13] 

abs. 

% 

5xpl 

3 

5 

223 

100% 

154 

150 

67.26% 

9sym 

5 

1 

261 

100% 

84 

32.18% 

135 

51.72% 

bw 

3 

14 

118 

100% 

79 

66.95% 

96 

81.36% 

clip 

5 

3 

1112 

100% 

673 

715 

con 

4 

1 

57 

100% 

42 

73.68% 

23 

40.35% 

5 

5 

2249 

100% 

2231 

2238 

99.51% 

inc 

4 

5 

193 

100% 

123 

63.73% 

120 

62.18% 

misexl 

4 

D 

99 

100% 

51 

51.52% 

28 

28.28% 

rd53 

3 

El 

100% 

25 

rd73 

□ 

H 

189 

100% 

63.49% 

rd84 

H 

n 

262 

100% 

125 

47.71% 

116 

44.27% 

sao2 

5 

H 

657 

100% 

246 

89 

13.55% 

squar5 

3 

D 

61 

100% 

38 

47 

average 

T 

425 

100% 

306 

59.26% 

301 

57.95% 

Table  2.  Experimental  results  for  3-  and  4-valued  adders,  showing  the  number  of  non-zero  coefficients 
in  the  expressions  computed  by  the  algorithm  in  Figure  2. 


m 

Example 

function 

Green  & 
Taylor  [11] 

Stankovic  & 
Moraga  [13] 

Dubrova  & 
Muzio  [14] 

abs. 

% 

abs. 

% 

abs. 

% 

3 

adder  2-bit 

Q 

MM 

100% 

14 

66.67% 

152.38% 

adder  3-bit 

H 

Ea 

100% 

42 

56.00% 

Bral 

220.00% 

4 

adder  2-bit 

4 

3 

77 

100% 

34 

44.16% 

88.31% 

adder  3-bit 

6 

4 

415 

100% 

174 

41.93% 

Bll 

126.27% 

average 

5 

4 

147 

100% 

66 

52.19% 

197 

146.74% 

Table  3.  Experimental  results  for  3-  and  4-valued  multipliers,  showing  the  number  of  non-zero  coeffi¬ 
cients  in  the  expressions  computed  by  the  algorithm  in  Figure  2. 


Green  & 

Stankovic  & 

Dubrova  & 

Example 

Taylor  [11] 

Moraga  [13] 

Muzio  [14] 

m 

function 

n 

k 

abs. 

% 

abs. 

% 

abs. 

% 

3 

mult.  2-bit 

4 

4 

52 

100% 

33 

63.46% 

31 

59.62% 

mult.  3-bit 

6 

6 

581 

100% 

369 

63.51% 

349 

60.07% 

4 

mult.  2-bit 

Q 

4 

278 

lESgl 

113 

40.36% 

108 

38.85% 

mult.  3-bit 

H 

6 

5813 

2830 

48.68% 

2171 

37.35% 

average 

H 

01 

1681 

100% 

836 

54.08% 

665 

48.97% 

96 
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.o5 

,m4 

110000-10 

1000100001 

30010 
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30030 
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30011 

20201 

30031 

20201 

10-10011 

0000100100 

2-212 

00210 

2-213 

00210 

.e 

.e 

Figure  4.  Fragment  of  Espresso  format  of  an 
original  benchmark  function  and  the  encoded 
4-valued  one. 


sis  (100%).  Tables  2  and  3  show  the  results  for  the  3-  and 
4-valued  adders  and  multipliers,  correspondently. 

The  experimental  results  show  that  for  the  benchmark 
functions  Reed-Muller-Fourier  form  and  our  expansion  are 
comparable  in  the  number  of  products  on  average.  The  ex¬ 
pansion  of  Green  and  Taylor  has  40%  more  products  on 
average.  Reed-Muller-Fourier  form  gives  best  results  for 
adders,  contrary  to  our  form  for  which  the  adders  are  hard. 
Our  form  seems  particularity  suitable  for  multipliers,  with 
5%  reduction  of  the  number  of  products  over  Reed-Muller- 
Fourier  form  and  51%  reduction  over  Green  and  Taylor’s 
form. 

For  the  Green  and  Taylor’s  and  our  expansions,  the  op¬ 
timal  polarity  is  computed  by  an  exhaustive  search  through 
all  mn  possible  polarities.  This  brute  force  approach  makes 
the  algorithm  infeasible  for  the  functions  with  n  >  6  and 
m  >  4.  E.g.,  it  takes  only  4  min  to  compute  the  optimal 
polarity  for  the  3-valued  3-bit  adder  having  6  inputs.  How¬ 
ever,  2  days  are  needed  to  find  the  solution  for  the  3-valued 
4-bit  adder  having  8  inputs.  Using  some  heuristics  for  find¬ 
ing  an  optimal  polarity  would  extend  the  capability  of  the 
algorithm,  but  so  far  no  such  heuristics  have  been  found. 

For  Reed-Muller-Fourier  form  [13],  the  exhaustive 
search  through  all  (m!)n  possible  polarities  is  out  of  ques¬ 
tion  even  for  n  >  2.  Therefore,  we  compute  mn  represen¬ 
tatives  of  (m!)n  possible  polarities  and  take  the  best  result. 
This,  of  course,  means  that  we  do  not  exploit  the  full  power 
of  Reed-Muller-Fourier  form  and  for  some  functions  the  re¬ 
sult  we  compute  is  not  the  best  one. 

When  analyzing  the  results  in  the  above  tables,  it  should 
be  taken  into  account  that  the  products  in  the  Reed-Muller- 
Fourier  form  and  the  form  [1 1]  use  GF(m)  multiplication, 
while  the  products  in  our  form  [14]  use  MIN.  With  present 
technologies,  MIN  has  a  much  simpler  implementation  than 


multiplication  in  GF(m ).  So,  the  circuit  realizing  a  MIN- 
based  expression  will  occupy  less  area  than  the  circuit  real¬ 
izing  a  GF(m)  multiplication-based  expression  of  the  same 
number  of  products  and  literals.  This,  of  course,  might 
change  as  technology  evolves. 

5.  Conclusion 

This  paper  compares  the  complexity  of  three  fixed  po¬ 
larity  generalizations  of  Reed-Muller  canonical  form:  the 
Galois  Field-based  form  introduced  by  Green  and  Taylor 
[1 1],  the  Reed-Muller-Fourier  form  of  Stankovic  and  Mor- 
aga  [13]  and  our  generalization  over  addition  modulo  m, 
minimum  and  the  set  of  all  literal  operators  [14].  Using 
the  theory  developed  [12],  [13]  and  [14]  we  implemented 
an  algorithm  for  computing  the  minimal  canonical  forms 
for  the  corresponding  generalizations  and  applied  to  a  set 
of  benchmark  functions,  adders  and  multipliers.  The  exper¬ 
imental  results  show  that  for  the  benchmark  functions  the 
Reed-Muller-Fourier  form  and  our  expansion  give  compa¬ 
rable  results.  On  average,  the  number  of  products  in  these 
expansions  is  40%  smaller  than  the  number  of  products  in 
the  expansion  of  Green  and  Taylor.  Adders  have  most  com¬ 
pact  representations  in  Reed-Muller-Fourier  form.  Our  ex¬ 
pansion  seems  to  be  most  suitable  for  multipliers. 

In  general,  it  is  hard  to  make  a  comparison  between  dif¬ 
ferent  representations  of  functions.  We  based  our  compari¬ 
son  on  the  benchmarks  results.  An  alternative  would  be  to 
compare  the  upper  bounds  on  the  number  of  products  in  the 
canonical  forms.  Unfortunately,  very  few  results  on  the  up¬ 
per  bounds  are  reported  for  the  case  of  m  >  2  and  none  for 
the  canonical  forms  introduced  in  [1 1],  [13]  and  [14]. 

The  main  open  problem  remains  finding  a  good  heuris¬ 
tic  for  selecting  an  optimal  polarity.  We  need  to  examine 
which  functional  properties  can  be  used  to  guide  the  choice 
of  polarity  and  also  to  investigate  which  search  techniques 
are  applicable  to  this  problem. 

Further  work  on  the  efficiency  of  computing  the  ex¬ 
pressions  might  also  incorporate  representation  of  multiple¬ 
valued  functions  by  Multiple- Valued  Decision  Diagrams 
(MDD)  [19],  and  performing  the  basic  operations  of  the  al¬ 
gorithm  directly  on  graphs.  Since  there  is  no  direct  corre¬ 
spondence  between  the  size  of  the  cover  for  the  function 
and  the  size  of  the  MDD  that  represents  it,  very  large  cube 
covers  can  be  captured  and  efficiently  manipulated  using 
this  representation.  For  the  Reed-Muller-Fourier  represen¬ 
tation,  some  work  in  this  direction  for  has  been  started  in 
[20]  and  [21]. 
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Abstract 

This  paper  considers  an  optimization  method  of  programmable 
logic  arrays  (PLAs),  which  have  two-input  EXOR  gate  at  the  out¬ 
puts.  The  PLA  realizes  an  EXOR  of  two  sum-of-products  expres¬ 
sions  (EX-SOP)  for  multiple-valued  input  two-valued  output 
functions.  We  present  techniques  to  minimize  EX-SOPs ,  which 
is  an  extension  ofDubrova-Miller-Muzio's  AOXMIN  algorithm. 
We  conjecture  that ,  when  n  is  sufficiently  large ,  an  EX-SOP  for 
n-bit  adder  requires  at  most  2”  products  while  an  ordinary  sum- 
of-products  expression  (SOP)  requires  6  •  2”  -  4 n  —  5  products. 
Experimental  results  for  two-  and  four-valued  benchmark  func¬ 
tions  show  that  the  proposed  method  produces  better  EX-SOPs 
than  existing  methods. 

Index  Terms — Three-level  network :,  logic  minimization ,  adder , 
multiple-valued  logic ,  programmable  logic  array. 

1  Introduction 

Programmable  logic  arrays  (PLAs)  with  two-input  EXOR 
gate  at  the  outputs,  also  known  as  AND-OR-EXOR  PLAs 
(Fig.  1),  are  a  powerful  architecture  to  realize  many  logic 
functions.  The  AND-OR-EXOR  PLA  realizes  an  EXOR  of 
two  sum-of-products  expressions  (EX-SOP).  Minimization 
of  the  number  of  products  in  EX-SOPs  is  an  important  step 
in  the  optimization  of  AND-OR-EXOR  PLAs,  because  the 
number  of  products  is  directly  related  to  the  cost  of  PLAs. 
EX-SOPs  are  promising  because,  for  many  practical  logic 
functions,  they  often  require  many  fewer  products  than 
sum-of-products  expressions  (SOPs)  [4,6, 13]. 

Minimization  of  EX-SOPs  were  considered  in  the  past  [8, 
14]  and  a  cut-and-try  method  was  reported  [9].  Design 
methods  for  adders  by  using  AND-OR-EXOR  PLAs  with 
more  than  one-bit  decoders  were  developed  at  IBM  [15].  In 
the  last  few  years  significant  progress  in  the  minimization 
of  EX-SOPs  have  been  made  [4, 6, 13].  Upper  bounds  on  the 
number  of  products  in  EX-SOPs  are  reported  [2, 3, 5] .  AND- 
OR-EXOR  network  where  the  output  EXOR  gate  have  un¬ 
limited  fan-in  is  considered  [12]. 

In  this  paper,  we  present  a  heuristic  method  to  minimize 
EX-SOPs,  which  is  an  extension  of  AOXMIN  [6].  Unlike 


AOXMIN,  we  can  minimize  EX-SOPs  for  multiple-valued 
input  two-valued  output  functions.  EX-SOPs  for  functions 
with  two-  and  four-valued  inputs  correspond  to  AND-OR- 
EXOR  PLAs  with  one-  and  two-bit  decoders,  respectively 
(Fig.  1)  [13].  We  also  present  a  method  to  further  reduce 
the  number  of  products  in  EX-SOPs  by  considering  out¬ 
put  phase  optimization  [11],  where  some  components  of  the 
function  are  implemented  in  the  complemented  form. 

A  crucial  step  in  AOXMIN  is  to  partition  the  products  of 
an  SOP  of  the  given  function  into  two  sets,  which  is  done  by 
a  random  method.  We  propose  a  partitioning  method  for 
adders.  Our  experimental  result  demonstrates  that,  for  an 
n-bit  adder  with  two-valued  inputs  and  sufficiently  large  n, 
the  proposed  partitioning  method  is  about  250  times  faster 
to  produce  about  two  times  better  solution  than  the  random 
partitioning  method.  For  adders  with  two-bit  decoders, 
proposed  partitioning  method  is  faster  than  random  par¬ 
titioning  method  in  producing  comparable  solutions. 

The  remainder  of  the  paper  is  organized  as  follows:  Sec¬ 
tion  2  reviews  terminologies.  Section  3  considers  out¬ 
put  phase  optimization  techniques.  Section  4  summarizes 
AOXMIN  and  describes  its  extensions.  Section  5  presents 
design  method  for  adders.  Section  6  shows  experimental 
results  and  conjectures  that,  when  n  is  sufficiently  large, 
an  EX-SOP  for  an  n-bit  adder  requires  at  most  2”  products. 
Section  7  presents  conclusions. 

2  Definitions  and  Terminologies 

In  this  section,  we  review  basic  terminologies  related  to 
multiple- valued  functions  [10,11]. 

Definition  2.1  A  multiple-valued  input  two-valued  output 
function ,  or  function  in  short ,  is  a  mapping 

f{X\,X2, . .  .,X«) :  X  Pi  ->  B, 

i=l 

where  Pi  =  {0,1,. . -  1),  pi  >2 ,  B  —  {0,1},  and  X,-  is  a 
multiple-valued  variable  taking  a  value  from  P,. 

Definition  2.2  Let  Si  C  P,.  A  literal  Xp  represents  0  if  X/  £  S, 
and  1  if  X,*  6  S,*.  A  product  Xp  Xp  *  *  •  Xp  is  AND  of  literals. 
A  cube  is  a  convenient  representation  of  a  product  for  computer 
manipulation. 
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Figure  1:  AND-OR-EXOR  PLA  with  (a)  one-bit  and 
(b)  two-bit  decoders. 


Definition  2.3  A  sum-of-products  expression  (SOP) 

V 

(Si,S2,...,S„) 

is  OR  of  products.  An  SOP  is  represented  by  a  cover ,  which  is  a 
set  of  cubes.  An  EX-SOP 

V  Xf‘X|2---Xf"  ®  V  —  (2.1) 

is  the  EXOR  of  two  SOPs. 

Definition  2.4  Let  f(X i,X2, . . .  ,X„)  (i  =  0,1 , . . .  ,m  —  1)  be 
an  n-input  m-output  function.  Then ,  the  two-valued  output 
function  F(Xi,X2/.  .  .,Xn,X„+i),  where  X„+1  is  an  m-valued 
variable  representing  the  outputs  such  that  F(Xi,X2,. .  *,Xn,i) 
=  fi(X 1,X2,.../X„),  is  the  characteristic  function  for  the 
multiple-output  function  [11]. 

Definition  2.5  An  SOP  for  a  multiple-output  function  in¬ 
dicates  an  SOP  for  its  characteristic  function ,  and  an  EX-SOP 
for  a  multiple-output  function  indicates  an  EX-SOP  for  its 
characteristic  function. 

Definition  2.6  The  intersection  of  the  products  C\  =  Xf’Xf2 
.  -  •  x%”  and  c2  =  Xj*  Xj2  •  •  *  Xj",  denoted  by  C\  n  c2,  is  the  prod¬ 
uct  X*nT’Xf2nT2  •  •  •  Xsn«"T*.  If  Sf  D  Tf  =  0  for  some  i,  then  the 
intersection  denotes  a  null  cube. 

Definition  2.7  Disjoint  sharp  of  two  covers  F  and  G,  denoted 
by  F  (g)  G,  represents  only  those  minterms  of  F  which  are  not 
contained  by  G. 

Definition  2.8  ON-set,  OFF-set,  and  DC-set  is  the  set  of 
cubes  for  which  the  function  value  is  1,  0,  and  unspecified ,  re¬ 
spectively. 

In  this  paper,  we  often  use  the  same  symbol  for  a  function 
and  its  cover;  and  unless  otherwise  specified,  adder  refers  to 
adder  without  carry  input ,  and  a  dm  represents  an  n-bit  adder. 

3  Output  Phase  Optimization 

In  many  cases,  we  can  realize  a  function  /  in  either  positive 
phase  (/)  or  negative  phase  (/).  For  m-output  function,  we 
can  choose  the  output  phases  in  2m  ways.  The  choice  of 
the  output  phases  in  the  realization  of  a  function  influences 
on  the  number  of  products  in  its  minimized  expressions. 
To  reduce  the  number  of  products  by  choosing  the  output 
phases  is  output  phase  optimization  [11]. 


Definition  3.1  Let  (Jo,fi,-  •  -,/m-i)  be  an  m-output  function. 
Then ,  the  minimized  SOP  G  for  the  characteristic  function  of 
(go,gi, ■  1)/  where  &  €  {/-,/;}  (»  =  0,1, 1)  such 

that  the  number  of  products  in  G  is  minimal  is  the  output  phase 
optimized  SOP  for  (fo,fi,  • .  .,/m- 1)- 

Similarly,  we  can  define  an  output  phase  optimized 
EX-SOP. 

We  handle  the  output  phase  optimization  of  EX-SOPs  by 
using  the  output  phase  optimization  of  SOPs.  We  use  an 
output  phase  optimized  SOP  as  the  input  of  the  EX-SOP 
minimizer.  For  a  function  with  m  outputs,  an  EX-SOP  min- 
imizer  produces  two  SOPs  each  having  m  outputs.  We  op¬ 
timize  the  output  phases  of  the  2m-output  SOP  to  obtain  an 
output  phase  optimized  EX-SOP. 

Let  the  output  phase  for  the  function  /  be  ax  €  {0,1}, 
where  ax  =  0  indicates  /,  is  in  the  positive  phase  and  a{  =  1 
indicates  /,  is  in  the  negative  phase.  Let  the  output  phases 
of  the  two  SOPs  of  the  EX-SOP  for  /■  be  bjo  and  bn.  There¬ 
fore,  the  output  phase  of  the  EX-SOP  for  f\  is  a,  0  fc,o  ©  bx j. 
When  output  phase  optimization  of  the  two  m-output  SOPs 
is  impractical,  we  consider  a,  as  the  output  phase  of  the 
EX-SOP  for  ft. 

An  output  phase  optimized  EX-SOP  can  be  realized  in  an 
AND-OR-EXOR  PLA,  where  the  polarity  of  the  outputs  are 
programmable. 

4  Simplification  Techniques 

In  this  section,  we  review  AOXMIN  [6],  which  is  a  heuristic 
algorithm  to  simplify  EX-SOPs.  We  then  present  an  exten¬ 
sion  of  AOXMIN. 

4.1  An  Overview  of  AOXMIN  [6] 

Basic  steps  of  AOXMIN  are  as  follows: 

1.  Obtain  a  minimized  cover  F  for  the  given  function  / 
and  compute  a  cover  R  for  /. 

2.  Group  the  cubes  of  F  into  clusters  of  cubes.  Two  cubes 
are  in  the  same  cluster  if  they  intersect  or  they  are  con¬ 
nected  through  a  chain  of  intersecting  cubes.  (In  [6],  a 
cluster  of  cubes  are  called  an  equivalence  class.) 

3.  Randomly  partition  the  cluster  of  cubes  into  two  cov¬ 
ers,  Fa  and  Fb. 

4.  Obtain  two  EX-SOPs  by  using  AOXMIN_SPECIFY- 
_Both(F^,Fb,R)  and  AOXMIN  .SPECIFY  _Both(  FB,FA,R) 
(Fig.  2).  AOXMIN_SPEClFYJBOTH  returns  two  SOPs 
which  form  an  EX-SOP.  ESPRESSO(Fk,Dk,Rk)  in  Fig.  2 
obtains  a  minimized  cover  for  a  function,  where  Fk/ 
Dk/  and  Rk  represents  the  ON-set,  DC-set,  and  OFF- 
set,  respectively. 

5.  Iterate  steps  3  and  4  for  some  specified  number 
of  times,  and  take  the  best  EX-SOP  among  all  the 
EX-SOPs  generated  so  far. 

In  addition,  AOXMIN  simplifies  complement  of  the 
given  function  and  uses  some  output  phase  optimization 
technique  to  obtain  better  solution. 
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1  procedure  AOXMIN^PECIFY-BOTH(Fi4,FB(R)  { 

2  F„  4-  EspresscKF^Fb); 

3  R assigned  ^  F0  ®  ^Ar 

4  Fftmp  4—  Fg  U  Reassigned’ 

5  Rtemp  +-F,,U(R® 

•^assigned)/ 

6  Ff,  «-  ESPRESSOlFfempAR/emp); 

7  return  (Ffl/Ffc); 

8} 


Figure  2:  Pseudocode  AOXMIN-Specify_Both. 

4.2  An  Extension  of  AOXMIN 

The  proposed  heuristic  method  to  simplify  EX-SOPs,  which 
is  an  extension  of  AOXMIN  [6],  have  the  following  features: 


•  It  can  simplify  EX-SOPs  for  functions  with  two-  and 
four-valued  variables,  and  can  treat  functions  where 
different  variables  have  different  values.  On  the  other 
hand,  AOXMIN  simplifies  only  two-valued  functions. 

•  It  uses  heuristic  algorithms  to  partition  the  cluster  of 
cubes  for  adders.  In  this  regard,  AOXMIN  uses  only  a 
random  partitioning  method. 

•  During  iterative  improvement,  it  concurrently  mini¬ 
mizes  both  SOPs  of  the  EX-SOP  to  reduce  the  total 
number  of  products  by  increasing  shared  products  be¬ 
tween  two  SOPs.  On  the  other  hand,  AOXMIN  uses 
simultaneous  minimization  of  both  SOPs  only  once  as 
part  of  its  simplification  technique  for  multiple-output 
functions. 

•  For  multiple-output  functions,  it  performs  concurrent 
simplification  of  all  the  outputs.  However,  AOXMIN 
simplifies  each  output  separately  throughout  the  algo¬ 
rithm.  A  modified  AOXMIN  considers  simplification 
of  all  the  outputs  simultaneously  [7]. 

•  For  the  output  phase  optimization  of  EX-SOPs,  it 
uses  techniques  for  the  output  phase  optimization  of 
SOPs  [11].  AOXMIN  handles  the  output  phase  opti¬ 
mization  problem  in  a  different  way. 

•  To  find  good  solutions  quickly,  especially  for  adders,  it 
selects  from  two  different  minimizers  for  SOPs.  On  the 
other  hand,  AOXMIN  uses  only  Espresso  [1]. 

•  The  method  makes  efficient  use  of  the  given  don't  care 
conditions  during  grouping  the  cover  into  cluster  of 
cubes  and  also  during  every  minimization  of  the  SOPs 
of  the  EX-SOP.  AOXMIN  does  not  use  don't  care  con¬ 
ditions  during  these  two  operations. 

Theorem  4.1  An  arbitrary  multiple-valued  input  two-valued 
output  function  can  be  represented  by  an  EX-SOP  of  the 
form  (2.1). 

The  minimization  of  SOP  for  a  multiple-output  function 
corresponds  to  the  minimization  of  SOP  for  its  characteris¬ 
tic  function  [11].  Similarly,  we  can  prove  the  following: 


1  procedure  MODiHED^PEClFY_BOTH(F/i, FB,D,R)  { 

2 

f7 AsharpD  ® 

3 

•F BsharpD  Fg  ®  D; 

4 

Fa «-  Simplify  JJNGLE(Fi4s/wrpD/D  u  R,FtshaTpD ); 

5 

^assigned  FaO  R) 

6 

& remained  R  ®  R assigned) 

7 

Fb  ^ BsharpD  0  Rassigned) 

8 

Ra  i—  FftgharpD  O  Remained/ 

9 

Rb  ^  F AsharpD  U  Remained’ 

10 

Fdb,  <-  MAKE.D0UBLE_0UT.C0VER( F„, Fg); 

11 

Rd w  «-  Make  .Double  _Out_Co  ver(  Ra,Rb); 

12 

Ddbl  <-  Make-Double.Out.Cover(D,D); 

13 

Fex-sop  «-  Simplify JX)UBLE(F<,W/D(,w,RdW); 

14 

15} 

return  Fex-sop', 

Figure  3:  Pseudocode  Modified^pecify_Both. 


/*  F  =  ON-set,  D  =  DC-set,  R  =  OFF-set  */ 

1  procedure  Simpufy_Local(F,D,R)  { 

2  Fir-  REDUCE(F,D); 

3  F  i—  EXPAND(F,R); 

4  Fi~  Irredundant(F,D); 

5  return  F; 

6} _ 

Figure  4:  Pseudocode  Simplify_Local. 

Theorem  4.2  The  minimization  of  EX-SOP  for  a  multiple- 
output  function  corresponds  to  the  minimization  of  EX-SOP  for 
its  characteristic  function . 

Now,  the  definition  of  the  cluster  of  cubes  can  be  ex¬ 
tended  as  follows: 

Definition  4.1  Let  F  and  D  be  the  covers  for  the  ON-set  and 
DC-set ,  respectively ,  of  the  characteristic  function  for  a  multiple- 
output  function.  Then,  two  cubes  cifCj  E  F  are  in  the  same 
cluster  if 

(a)  G(i,j)  0,  or 

(b)  G(i,i  -h  1)  ^  0,G(t  +  l,t  +  2)  ^  0, . . .  ,G(j  -  1,;)  #  0, 
where  G(p,q)  denotes  (cp  fl  cq)  ®  D. 

Section  4.1  shows  that  during  every  iteration  AOXMIN 
calls  AOXMIN^peofyJSoth  twice.  We  replaced  these 
calls  by  Modified_Specify_Both(F/i, Fb,D,F)  and  Mod- 
ihed_Specify_Both(Fb, FA/D,R)  (Fig.  3).  Make_Double- 
_OUT-CovER(Fjt,G*)  in  Fig.  3  receives  n-input  m-output  cov¬ 
ers  Fk  and  Gk,  and  returns  an  n-input  2m-output  cover  such 
that  covers  corresponding  to  outputs  0,1,... ,m  -  1  and 
m,m  +  1, . . .  ,2m  -  1  represent  Fk  and  Gk ,  respectively. 

In  Fig.  3,  both  SIMPLIFY SlNGLE(Fk,Dk,Rk)  and  SlM- 
PLIFY-DOUBLE (Fk,Dk,Rk)  obtain  a  minimized  cover  for  a 
function,  where  Fk,  Dk,  and  Rk  represents  the  ON-set, 
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adr3:  Is,  22,  32,  52 

adr4:  Is,  22,  32, 52, 72,  II2 

adr5:  I5, 22, 32, 52, 72, 112, 152,  232 

adr6:  Is,  22, 32, 52, 72, 112,  152, 232, 312, 472 

adr7:  15, 2Z,  32, 52, 72, 112, 152, 232, 312, 472, 632, 952 

adr8:  15, 22, 32, 52, 72, 112, 152, 232, 312, 472,  632, 952, 1272, 1912 

adr9:  I5, 22/  32, 52/  72, 112,  152/  232/  312, 472,  632, 952, 1272/ 1912,  2552,  3832 

adrlO:  15, 22, 32, 52, 72, 112, 152, 232/  312, 472, 632, 952, 1272, 1912, 2552, 3832, 5112/  7672 

adrll:  Is,  22, 32, 52, 72, 11 2, 152, 232, 312, 472, 632, 952, 1272, 1912, 2552, 3832, 5112, 7672, 10232, 15352 

Figure  5:  Distribution  of  the  clusters  of  output  phase  optimized  SOPs  for  adders  with  two-valued  inputs. 


DC-set,  and  OFF-set,  respectively.  SIMPLIFY  .SINGLE  and 
SimplifY-Double  can  be  either  Simplify_Local  (Fig.  4)  or 
Espresso-MV  [10].  SlMPLiFY_LOCAL  uses  a  single  pass  of 
Reduce,  Expand,  and  Irredundant  operations  to  obtain 
a  simplified  SOP  [10].  It  reduces  the  number  of  cubes  by 
locally  changing  the  shape  of  the  cubes.  Espresso-MV  iter¬ 
ates  these  operations  as  long  as  the  solution  improves.  Sec¬ 
tions  5  and  6  explain  how  the  choice  of  the  two-level  mini¬ 
mizes  affect  the  quality  of  the  solution  and  execution  time. 

5  Design  of  Adders 

In  this  section,  we  propose  partitioning  methods  of  the  clus¬ 
ter  of  cubes  for  adders  with  one-  and  two-bit  decodes,  and 
discuss  about  the  choice  of  the  two-level  minimizes.  Note 
that  EX-SOPs  for  functions  with  two-  and  four-valued  in¬ 
puts  correspond  to  AND-OR-EXOR  PLAs  with  one-  and 
two-bit  decodes,  respectively  (Fig.  1). 

During  minimization  of  addes,  we  use  SIMPLIFY _Local 
for  Simplify -Single  and  Espresso-MV  for  Simplify_Double 
in  Fig.  3.  We  observe  that  if  Espresso-MV  is  used  for 
SlMPUFY-SlNGLE  then  the  resulting  awkward  shape  of 
Rassignrd  in  Fig.  3  prevent  us  from  obtaining  a  good  solu¬ 
tion  in  the  next  minimization  by  using  SIMPLIFY-DOUBLE. 

5.1  Adders  with  One-Bit  Decoders 

We  found  that  output  phase  optimized  SOP  for  n-bit  (3  < 
tt  <  11)  adder  with  two- valued  inputs  have  An  -  1  cluster  of 
cubes.  Fig.  5  shows  the  distribution  of  these  clusters,  where 
an  entry  ck  represents  k  clusters  each  having  c  cubes.  It  is 
interesting  that  the  number  of  cubes  in  the  clusters  have  a 
regular  structure.  To  partition  the  cluster  of  cubes  into  two 
covers  FA  and  FB,  we  use  the  following  method: 

1.  Sort  the  clusters  in  descending  order  of  the  number  of 
cubes  in  it. 

2.  Starting  from  the  beginning  of  the  sorted  list  of  the 
clusters,  alternatively  add  a  pair  of  clusters  to  FA  and  a 
pair  of  clusters  to  FB- 

3.  Add  the  remaining  cluster  to  Fb. 

Example  5.1  For  three-bit  adder  with  two-valued  inputs ,  the 
number  of  cubes  in  the  clusters  which  form  FA  and  FB  are  5, 5, 2, 
2, 1,  2,  and  3, 3,  2, 2,  2,  respectively. 


adr4: 

14/  22, 32 

adr5: 

I4/  22, 32, 42 

adr6: 

I4,  22/  32,  42,  52 

adr7: 

I4/  22, 32, 42, 52, 62 

adr8: 

14/  22, 32, 42/  52, 62, 72 

adr9: 

Ur  22/  32, 42, 52, 62, 7 2, 82 

adrlO 

:  14, 22, 32, 42, 52, 62, 72/  82, 92 

adrll: 

:  I4, 22, 32, 42, 52, 62, 7 2, 82,  92, 102 

Figure  6:  Distribution  of  the  clusters  of  output  phase  opti¬ 
mized  SOPs  for  adders  with  two-bit  decoders. 


The  above  partitioning  method  is  devised  by  considering 
outputs.  Adders  have  pairs  of  clusters,  where  each  pair  be¬ 
longs  to  a  particular  set  of  outputs.  Roughly,  the  strategy 
is  to  put  the  clusters  from  such  pair  into  two  different  par¬ 
titions.  A  similar  method  is  also  devised  for  adders  with 
four-valued  inputs. 

5.2  Adders  with  Two-Bit  Decoders 

We  obtained  functions  with  four-valued  inputs  from  their 
two-valued  counterparts  by  pairing  two  variables  using 
Espresso-MV  [10].  Fig.  6  shows  the  distribution  of  the  clus¬ 
ters  of  output  phase  optimized  SOPs  for  adders  with  two- 
bit  decoders,  where  an  entry  ck  represents  k  clusters  each 
having  c  cubes.  It  shows  that  the  output  phase  optimized 
SOP  for  n-bit  (4  <  n  <  11)  adder  with  two-bit  decoders  have 
In  clusters.  Note  that  the  number  of  cubes  in  the  clusters  for 
adders  with  two-bit  decoders  also  have  a  regular  structure. 
We  use  the  following  method  to  partition  the  clusters  into 
two  covers  FA  and  FB: 

1.  Sort  the  clusters  in  descending  order  of  the  number  of 
cubes  in  it. 

2.  Starting  from  the  beginning  of  the  sorted  list  of  the 
clusters,  at  first  add  a  pair  of  clusters  to  FAf  then  al¬ 
ternatively  add  a  cluster  to  FA  and  FB. 

6  Experimental  Results 

We  implemented  the  proposed  heuristic  method  to  simplify 
EX-SOPs,  which  is  an  extension  of  AOXMIN  [6],  in  C  by 
using  Espresso-MV  [10]  routines.  On  an  HP  C160  worksta¬ 
tion  with  256  megabytes  memory  resources,  we  conducted 
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Table  1 :  Experimental  result  for  adders  with  two-valued  inputs. 

~  —  ""  Random  Partition 


Proposed  Partition  20  Iterations  50  Iterations 


Data 

In 

Out 

SOP 

OPO 

SOP 

EX-SOP 

Time 

OPO 

EX-SOP 

EX-SOP 

Time 

EX-SOP 

Time 

adr3 

6 

4 

31 

25 

12 

0.02 

11 

17 

1.24 

13 

2.95 

adr4 

8 

5 

75 

61 

21 

0.07 

18 

32 

4.48 

32 

13.46 

adr5 

10 

6 

167 

137 

37 

0.35 

36 

50 

26.04 

50 

61.98 

adr6 

12 

7 

355 

293 

67 

1.45 

66 

146 

114.38 

133 

332.95 

adr7 

14 

8 

735 

609 

122 

4.56 

120 

128 

422.94 

128 

1033.24 

adr8 

16 

9 

1499 

1245 

233 

19.58 

MEM 

423 

1634.73 

380 

3972.45 

adr9 

18 

10 

3031 

2521 

454 

66.42 

MEM 

840 

6469.34 

840 

16492.60 

adrIO 

20 

11 

6099 

5077 

967 

312.17 

MEM 

2168 

28767.08 

1898 

74434.15 

adrll 

22 

12 

12239 

10193 

1993 

1596.42 

MEM 

4136 

169809.86 

3677 

425304.35 

OPO:  Output  phase  optimized.  MEM:  Espresso-MV  memory  over. 


Table  2:  Experimental  result  for  adders  with  two-bit  decoders. 

Random  Partition 


Proposed  Partition  20  Iterations  50  Iterations 


Data 

SOP 

OPO 

SOP 

EX-SOP 

Time 

OPO 

EX-SOP 

EX-SOP 

Time 

EX-SOP 

Time 

adr4 

17 

14 

13 

0.09 

12 

13 

1.64 

13 

4.10 

adr5 

26 

22 

18 

0.34 

18 

18 

5.81 

18 

15.64 

adr6 

37 

32 

25 

0.81 

25 

25 

18.39 

25 

51.90 

adr7 

50 

44 

33 

1.94 

33 

33 

58.91 

33 

136.55 

adr8 

65 

58 

42 

6.62 

42 

43 

181.41 

43 

429.76 

adr9 

82 

74 

52 

26.11 

52 

51 

655.76 

51 

1523.54 

adrIO 

101 

92 

63 

74.46 

63 

65 

2433.61 

61 

5911.23 

adrll 

122 

112 

75 

353.35 

75 

75 

7918.99 

75 

23507.19 

OPO:  Output  phase  optimized. 


experiments  by  using  adders  with  two-  and  four- valued  in¬ 
puts  and  other  benchmark  functions  with  four-valued  in¬ 
puts.  We  obtained  functions  with  four-valued  inputs  from 
their  two-valued  counterparts  by  pairing  two  variables  us¬ 
ing  Espresso-MV.  For  all  the  experiments,  we  prepared 
minimized  SOPs  and  output  phase  optimized  SOPs  by  us¬ 
ing  Espresso-MV  with  default  options.* 

Tables  1  and  2  summarize  the  experimental  data  for 
adders  with  one-  and  two-bit  decoders,  respectively.  To 
minimize  EX-SOPs  for  adders  we  used:  a)  output  phase 
optimized  SOPs  as  the  input  for  the  EX-SOP  minimizer; 

b)  two  different  techniques  to  partition  the  cluster  of 
cubes:  partitioning  method  for  adders  from  Section  5 
and  random  partitioning  method  from  AOXMIN  [6];  and 

c)  Simplify  .Local  for  Simplify_Single  and  Espresso-MV 
for  Simplify -Double  in  Fig.  3. 

Table  1  shows  that,  for  an  n-bit  adder  with  sufficiently 
large  n,  the  proposed  partitioning  method  is  about  250 
times  faster  to  produce  about  two  times  better  solution  than 
the  random  partitioning  method.  Note  that  an  SOP  and 
an  output  phase  optimized  SOP  for  «-bit  adder  with  two¬ 
valued  inputs  require  6  •  2”  —  4n  —  5  and  5  *  2”  —  4«  —  3  prod¬ 
ucts,  respectively  [11],  However,  from  Table  1,  we  have  the 
following: 

*For  all  the  tables  in  this  section,  the  columns  with  heading  'SOP', 
'OPO  SOP',  'EX-SOP',  and  'OPO  EX-SOP'  indicate  the  number  of  prod¬ 
ucts  in  the  corresponding  expression,  where  'OPO'  is  an  abbreviation  for 
'output  phase  optimized'.  Also,  the  columns  with  heading  Time'  indicate 
the  CPU  seconds  spent  by  the  extended  version  of  AOXMIN  to  simplify 
EX-SOP  and  it  do  not  indude  the  time  to  prepare  minimized  SOP  or  out¬ 
put  phase  optimized  SOP. 


Conjecture  6.1  When  n  is  sufficiently  large ,  an  output  phase 
optimized  EX-SOP  for  n-hit  adder  with  two-valued  inputs  re¬ 
quires  at  most  2”  products. 

The  above  shows  that  an  output  phase  optimized 
EX-SOP  requires  about  one  sixth  of  the  products  in  an 
SOP.  This  result  would  be  useful  to  design  adders. 

Table  2  shows  that  the  proposed  partitioning  method 
produced  good  solutions  quickly.  However,  in  most  cases, 
these  solutions  can  be  obtained  by  random  partitioning 
method  by  a  reasonable  increase  in  the  computation  time. 
The  experimental  data  also  reveals  that  the  minimization 
time  for  EX-SOPs  with  four-valued  inputs  is  much  smaller 
than  that  for  the  corresponding  EX-SOPs  with  two-valued 
inputs,  because  the  former  requires  many  fewer  products 
than  the  later.  Note  that  an  EX-SOP  for  an  n-bit  adder 
with  two-bit  decoders  requires  at  most  (n2  -I-  n  +  2)/2  prod¬ 
ucts  [13]. 

Table  3  presents  experimental  results  for  benchmark  cir¬ 
cuits  with  four-valued  inputs.  We  used  both  SOPs  and 
output  phase  optimized  SOPs  as  the  input  for  the  EX-SOP 
minimizer;  and  Espresso-MV  for  both  SlMPUFY_SlNGLE  and 
Simplify JDouble  in  Fig.  3. 

We  used  adr6  to  see  how  the  choice  of  the  two-level 
minimizers  in  Fig.  3  affect  the  quality  of  the  solution  and 
execution  time.  By  using  random  partitions  and  1000  it¬ 
erations,  we  found  that  when  Espresso-MV  is  used  for 
both  Simplify jSingle  and  Simplify_Double  the  algorithm 
requires  6253.79  seconds  and  produces  a  solution  with 
122  products;  however,  when  we  use  SIMPLIFY.LOCAL  for 
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Table  3:  Experimental  result  for  EX-SOPs  with  four-valued  inputs. 


Input  is  SOP 

Input  is  OPO  SOP 

_ 

■ 

20  Iterations 

50  Iterations 

20  Iterations 

50  Iterations 

Data 

SOP 

EX-SOP 

Time  EX-SOP 

Time 

OPO 

SOP 

EX-SOP 

Time 

EX-SOP 

Time 

5xp1 

addm4 

f51m 

life 

rd84 

rdm8 

sqr8 

46 

109 

51 

26 

54 

51 

157 

35 

97 

37 

20 

35 

37 

152 

19.31 

208.03 

17,86 

4.72 

22.81 

17.85 

274.39 

35 

95 

35 

20 

35 

35 

147 

51.15 

519.51 

47.13 

11.86 

57.72 

44.19 

674.02 

42 

101 

50 

26 

37 

50 

148 

29 

89 

40 

20 

31 

40 

139 

14.89 

168.09 

18.16 

4.74 

37.30 

18.17 

212.63 

29 

89 

37 

20 

31 

37 

139 

39.56 

404.82 

46.20 
12.43 
92.09 

60.20 
577.24 

OPO:  Output  phase  optimized. 


Simplify -Single  and  Espresso-MV  for  Simplify .Double, 
the  algorithm  produces  a  solution  with  81  products  and 
requires  5956.69  seconds.  We  found  similar  tendencies 
for  other  adders  too.  However,  it  is  our  experience  that, 
for  many  other  benchmark  functions,  Espresso-MV  for 
both  SIMPUFY-SINGLE  and  SimplifyJDouble  is  often  a  good 
choice. 

7  Conclusions  and  Comments 

EX-SOPs  are  promising  because  they  often  require  many 
fewer  products  than  SOPs.  We  demonstrated  that,  when 
n  is  sufficiently  large,  an  n-bit  adder  with  two-valued  in¬ 
puts  requires  at  most  2”  products  in  an  output  phase  opti¬ 
mized  EX-SOP,  while  an  output  phase  optimized  SOP  re¬ 
quires  5  •  2”  -  4n  -  3  products.  We  presented  partitioning 
method,  which  is  very  effective  to  optimize  EX-SOPs  for 
adders.  Our  experimental  result  shows  that  random  parti¬ 
tioning  method  is  unsuitable  to  design  adders  when  n  is 
large,  because  it  requires  excessive  amount  of  CPU  time 
to  obtain  a  moderate  design.  For  adders  with  two-bit  de¬ 
coders,  proposed  partitioning  method  is  faster  than  ran¬ 
dom  partitioning  method  in  producing  comparable  solu¬ 
tions.  We  found  that  the  choice  of  the  two-level  minimizers 
in  AOXMIN-like-algorithm  have  a  great  influence  on  the 
number  of  products  in  EX-SOPs  and  a  powerful  minimizer 
is  not  always  a  good  choice. 

We  obtained  functions  with  four-valued  inputs  from 
their  two-valued  counterparts  by  pairing  two  variables  us¬ 
ing  Espresso-MV  code  [10],  which  reduces  the  number  of 
products  in  SOPs  [11].  A  different  pairing  algorithm  target¬ 
ing  EX-SOPs  may  lead  to  better  solutions.  Currently,  we  are 
studying  minimization  of  EX-SOPs  for  adders  with  carry 
inputs. 
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Abstract 

This  paper  shows  the  effectiveness  of  an  output 
permutation  for  the  implementation  of  current-mode 
CMOS  circuits.  A  combination  of  a  simple  function  and 
an  output  permutation  can  realize  a  difficult  function  and 
cost  for  the  combination  will  be  lower  than  the  cost  for  a 
difficult  function.  The  output  permutation  can  be  realized 
by  a  universal  literal  and  we  can  calculate  the  cost.  We 
first  examine  the  all  combinations  of  universal  literals  and 
output  permutations  and  show  that  some  combinations  can 
realize  one-variable  functions  with  lower  costs .  Next ,  we 
minimize  two-variable  functions  and  compare  the  costs 
with  the  costs  obtained  by  some  output  permutations.  As 
the  result ,  we  show  that  about  70%  functions  can  reduce 
the  costs  and  their  reduction  ratio  is  about  12%  on  average. 
Keywords:  logic  synthesis,  cost  reduction,  universal 
literal,  output  permutation,  current-mode  CMOS  circuits. 

1.  Introduction 

Recently,  the  use  of  current-mode  CMOS  circuits  for 
implementation  of  multiple-valued  logic  functions  has 
been  studied  by  many  researchers  [1-5].  They  have  used 
universal  literals  to  express  logic  functions  and  design  their 
circuits  based  on  cost  tables.  A  cost  table  is  a  list  of  all 
literals  with  the  corresponding  implementation  cost. 
Butler  et  al  showed  two  basic  current-mode  CMOS 
circuits  and  calculated  the  costs  of  some  functions  [1]. 
Based  on  the  cost  table,  Lei  et  al.  calculated  the  costs  of  all 
four-valued  one-variable  functions  and  proposed  a 
minimization  method  for  two-variable  functions  in  1991 
[2].  Dueck  et  al.  proposed  a  direct  cover  method  in  1992 
[4]  and  a  minimization  method  using  the  simulated 
annealing  in  1998  [5]  based  on  the  cost  table  of  Ref.  [2]. 
Moreover,  in  1992,  Lei  et  al  proposed  the  sharing  of  basic 
components  and  showed  that  costs  for  some  one- variable 
functions  can  be  reduced  [3]. 

On  the  other  hand,  in  the  sum-of-products  expression 
using  window  literals,  Hata  et  al.  showed  the  effectiveness 
of  output  permutation  [6],  This  paper  showed  that  an 
output  permutation  could  save  about  10%  product  terms  on 
average  comparing  with  original  expression.  In  the  case  of 
logic  expression  using  universal  literals,  the  cost  depends 
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on  not  only  the  number  of  product  terms  but  also  the 
universal  literals  forming  the  expression.  We  conclude 
that  the  output  permutation  can  save  the  cost  not  only 
when  the  number  of  product  terms  is  reduced  by  the  output 
permutation  but  also  when  the  number  is  not  reduced.  We 
therefore  investigate  the  effectiveness  of  the  output 
permutation  for  the  logic  expression  using  the  universal 
literals. 

In  this  paper,  we  investigate  the  effectiveness  of  the 
output  permutations  for  the  implementation  of  current¬ 
mode  CMOS  circuits.  We  first  show  that  the  cost  of 
output  permutation  can  be  calculated  by  cost  table  of 
universal  literals  and  show  examples  of  cost  reduction 
using  it.  Next,  we  examine  all  combinations  of  universal 
literals  and  output  permutations,  and  show  that  some 
combinations  can  realize  functions  with  lower  costs. 
Finally,  we  minimize  the  two-variable  functions  and 
compare  the  costs  with  costs  obtained  by  some  output 
permutations.  As  the  result,  we  show  that  about  70% 
functions  can  reduce  the  costs  and  its  reduction  ratio  is 
about  12%  on  average. 

2.  Preliminaries 

Let  P  =  {0,  1,  2,  ...,  p  - 1}  be  a  set  of  /^-valued  truth 
values.  Let  X={x\9  jc2,  ...,  xn}  be  a  set  of  n  variables, 
where  jc,  (z  =  1,  2,  ...,  n)  takes  on  values  from  P.  An p- 
valued  w-variable  function  F(X)  is  a  mapping  F:Pn->P]. 
Each  element  in  the  domain  of  die  function  is  said  to  be  a 
minterm  of  the  function. 

A  universal  literal  is  a  one-variable  function  and  we 
denote  it  by  <a0a{...ap_x>(xj)  instead  of  original  notation 
<a0a]...ap_i>Xr  The  universal  literal  is  defined  by 
<a0al...afhl>(xi)  =  aXi. 

A  product  term 

A  =  <a\oa\\...aip_]>(x{)  <a2o<*2\—a2 

-.•<anQfln\—an 

is  defined  as  the  minimum  of  its  literals. 

The  truncated  sum  (TSUM  for  short)  of  two  product 
terms  A  and  B,  expressed  as  F  =  A  +  B,  is  defined  as 
F(a)  =  min(i4(a)+5(a),  p- 1), 

for  all  minterms.  Any  ^-valued  «-variable  function  can  be 
expressed  as  truncated  sum  of  product  terms. 
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Table  1  A  cost  table  for  all  four-valued  universal  literals 
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3.  The  output  permutation  and  its 
effectiveness 

A  circuit  for  a  function  is  implemented  by  current-mode 
CMOS  circuit.  We  design  the  circuit  by  the  logic 
expression  using  universal  literals  and  Fig.  1  shows  its 
circuit  structure.  The  universal  literals  are  composed  of 
current  source,  transistor  and  so  on,  and  their  costs  are 
calculated  based  on  the  number  of  transistors.  Lei  et  al 
tabulated  the  costs  for  the  all  four- valued  universal  literals 
in  [2],  and  the  table  was  updated  in  [3].  Table  1  shows  the 
cost  table  for  all  four-valued  universal  literals  [2-3].  In 
this  table,  first  column  indicates  the  output  values  for 
input  values  0  and  1,  and  first  row  indicates  the  output 
values  for  2  and  3.  Using  this  table,  we  can  calculate  the 
circuit  costs,  where  cost  for  MIN  function  is  calculated  as 
5  according  to  [2]. 

As  shown  in  the  table,  the  each  of  the  universal  literals 
has  a  certain  cost  and  there  is  a  tendency  that  the  cost  of 
literal  is  low  if  its  function  is  monotone  between  the  costs 
of  universal  literals  and  their  functions.  The  total  cost  of 
circuit  composed  of  such  low-cost  literals  is  small. 
However,  literals  for  a  function  is  almost  determined  by 
the  function.  Then,  we  propose  an  output  permutation  as 
a  method  to  transform  the  original  function  and  to  use 
lower-cost  literals.  The  circuits  can  be  realized  by  a 
structure  as  shown  in  the  Fig.  2.  The  structure  will  be 
effective  for  reduction  of  circuit  costs,  although  it  can't  use 
in  time-critical  applications  because  of  increase  of  the 
circuit  levels. 

The  output  permutation  has  some  cost  and  the 
combination  of  the  transformed  function  and  the 
permutation  might  be  more  costly  than  the  direct 
implementation  of  original  function.  Then,  we  must 


Fig.  1  A  conventional  circuit  structure. 
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Fig.  2  A  proposed  circuit  structure 


evaluate  their  total  costs  to  show  the  effectiveness  of  the 
output  permutations.  The  output  permutation  can  be 
realized  by  a  universal  literal,  because  it  is  a  one- variable 
function.  Then,  we  can  calculate  their  cost  using  the  cost 
table  of  universal  literals.  We  show  some  examples. 
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Example  1:  The  logic  function  as  shown  in  Fig.  3  can 
be  expressed  by 

F(X)  =  <0012>(x,)  +  <3010>(jc,)  <03 1 2>(jc2) 

+  <0300>(x,)  <3122>(x2) 

and  its  cost  is  68.  The  combination  of  a  function  as 
shown  in  Fig.  4  and  output  permutation  <0312>(Fr)  can 
realize  the  function  of  Fig.  3.  Then,  the  function  of  Fig.  3 
also  can  be  expressed  by 

F\X)  =  <0123>(x,)  +  <0123>(x2) 

F(F')  =  <0312>(F) 

and  their  total  cost  is  17  (6+1 1).  The  function  of  Fig.  4 
can  be  derived  by  using  the  inverse  function  of 
<0312>(Fr).  Figure  5  shows  the  permutation  and  its 
inverse.  (End  of  example) 

Example  2:  The  logic  function  as  shown  in  Fig.  6  can 
be  expressed  by 

F(X)  =  <3100>(X])  <1  133>(jc2) 

+  <0130>(jc,)  <0310>(x2) 

+  <0012>(x1}  <0021>(jc2) 

and  its  cost  is  73  (=9+6+14+13+4+12+5*3).  The 
combination  of  a  function  as  shown  in  Fig.  7  and  output 
permutation  <2301>(Fr)  can  realize  the  function  of  Fig.  6. 
Then,  the  function  of  Fig.  6  also  can  be  expressed  by 

F'(X)  =  <0133>(x,)  <3000>(jc2) 

+  <2103>(*i)  <3300>(jc2) 

+  <0233>(jc])  <0012>(jc2) 

F(Fr)  =  <3210>(F) 

and  their  total  cost  is  67  (=5+8+14+9+4+4+5*3+8).  (End 
of  example) 

Example  1  is  an  extreme  case.  In  this  case,  the  number 
of  product  terms  is  reduced  by  an  output  permutation  and  it 
causes  the  cost  reduction.  In  the  case  of  Example  2,  the 
number  of  product  terms  is  not  reduced,  but  the  universal 
literals  requiring  lower  costs  are  used  with  an  output 
permutation.  Thus,  the  cost  can  be  reduced  by  the  output 
permutation. 

First,  we  investigated  the  effectiveness  of  the  output 
permutations  for  one-variable  functions.  We  have 
combined  a  universal  literal  and  an  output  permutation,  and 
have  derived  a  function  obtained  by  them.  We  have 
compared  the  total  cost  of  the  universal  literal  and  the 
output  permutation  with  the  cost  for  direct  implementation 
of  the  obtained  function.  We  have  done  them  for  all 
combinations  of  universal  literals  and  output  permutations, 
and  have  listed  all  combinations  which  requiring  lower 
costs  than  direct  implementation.  Table  2  shows  the 
result.  The  saved  costs  for  most  of  the  functions  have 
been  1  as  shown  in  this  table,  but  we  consider  that  it  is 
remarkable  that  the  output  permutations  can  reduce  the 
costs  on  even  one-variable  functions. 
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Fig.  3  A  function  of  example  1. 
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Fig.  4  A  transformed  function  of  Fig.  3. 
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Fig.  5  The  output  permutation  and  its  inverse. 
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Table  2  Combinations  requiring 


combination  I  total  cost 


lower  costs  than  direct  implementation. 


function 


<0022> 


<021 1> 


<0303> 


<3020> 


<3022> 


<3030> 


<3122> 


<3130> 


<3303> 


saved  cost 


1 


2 


1 


1 


1 


1 


2 


1 


1 


Table  3  Costs  of  direct  implementation  and  those  with  output  permutation  by  our  CAD  tools 


_ average  cost _ 

number  original  function  with  permutation 


1000  89.7  81.8  8.7 


686  I  94.3  I  82.8  I  12.2 


all  functions 


cost-saved  functions 


Table  4  The  number  of  times  which  each  permutation  is  used  bv  our  CAD  tools. 


ermutation  1 

1  cost  1 

1  times 

in 

ermutation 

1  cost  1 

I  times 

ilp 

none 


<3210> 


<2310> 


<1023> 


<032 1> 


<3012> 


<01 32> 


<1230> 


<2103> 


<03 12> 


<2013> 


<1320> 


<3201> 


<1203> 


<023 1> 


<1302> 


ermutation 


<02 13> 


<2301> 


<3102> 


<3 1 20> 


<2130> 


<302 1> 


<1032> 


<203 1> 


cost  times 


13  21 


Table  5  Costs  of  direct  implementation  and  those  with  output  permutation  by  the  CAD  tools  of  Ref.  [2], 


_ average  cost _ 

number  original  function  1  with  permutation  saving  ratio 


1000  95.2  86.3  I  9.3 


722  I  99^6  r  87.4  I  12.3 


all  functions 


cost-saved  functions 


Table  6  The  number  of  times  which  each  permutation  is  used  bv  the  CAD  tools  of  Ref.  [2 


ermutation  cost  tunes 


278 


none 


<3210> 


<03 12> 


<2310> 


<1023> 


<2013> 


<1230> 


<032 1> 


rmutation 


<3012> 


<1320> 


<3201> 


<023 1> 


<0213> 


<0132> 


<230 1> 


<2103> 


cost  times 


12  I  41 


ermutation 


<1203> 

17 

<3 1 02> 

15 

<1302> 

19 

<203 1> 

20 

<2130> 

19 

<302 1> 

16 

<3 1 20> 

18 

<1032> 

20 
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The  all  functions  of  Table  2  consist  of  two  or  three 
kinds  of  logic  values  and  it  allows  the  combination  to  be 
low  cost.  However,  such  universal  literals  can't  output  all 
kinds  of  logic  values.  Most  of  two-or-more-variable  logic 
functions  have  all  logic  values  on  their  truth  tables  and 
functions  not  having  them  are  few.  Therefore,  we  use 
universal  literals  having  all  logic  values  as  an  output 
permutation  except  for  <0123>  and  investigate  the 
effectiveness  of  them  for  two-variable  functions. 

The  best  output  permutation  for  a  function  can  be 
obtained  as  follows.  First,  we  minimize  a  given  function 
and  calculate  the  cost.  Second,  for  each  of  all  output 
permutations,  the  number  of  kinds  is  23  (=  4!  -  1),  we 
derive  the  inverse  permutation  and  transform  the  given 
function  using  it.  We  minimize  the  transformed  function 
and  calculate  the  total  cost  of  the  function  and  its  output 
permutation.  Finally,  we  compare  the  costs  and  select  the 
combination  requiring  lowest  costs. 

According  to  the  above,  we  have  examined  the 
randomly  generated  1000  four- valued  two- variable 
functions  having  all  logic  values,  and  show  the  result  in 
Table  3.  The  table  shows  that  we  can  reduce  the  cost  for 
686  functions  out  of  1000  functions  and  their  saved  cost  is 
about  11.5  (12.2%)  on  average.  Moreover,  we  show  the 
number  of  times  which  each  output  permutation  is  used  in 
Table  4.  This  table  shows  that  the  cost  of  main  seven 
permutations  are  less  than  or  equal  to  12,  and  shows  that 
the  greater  cost  of  universal  literal  is,  the  fewer  times  the 
literal  is  used.  The  universal  literals  whose  cost  is  less 
than  or  equal  to  12  are  <3210>,  <2310>,  <1023>, 
<032 1>,  <3012>,  <0132>,  <1230>,  <0312>,  <0231>, 
and  total  times  which  the  nine  literals  are  used  is  409.  In 
addition,  the  314  functions  do  not  need  an  output 
permutation.  Then,  we  can  expect  to  obtain  the  best 
solutions  for  about  70%  functions  by  minimizing  a 
function  ten  times,  no  permutation  and  above  nine 
permutations.  The  minimization  of  the  above  is  done  by 
GA  based  CAD  tool  made  by  us,  while  the  similar  results 
are  obtained  when  we  minimize  them  by  Ref.  [2]  based 
CAD  tool  and  Tables  5  and  6  show  the  result.  From  the 
tables,  we  can  say  that  our  CAD  tools  derive  lower 
solution  than  the  CAD  tools  of  Ref.  [2]  and  the  cost 


reduction  is  not  due  to  low  minimization  efficiency  of  the 
CAD  tools.  Then,  output  permutation  is  effective  for  the 
cost  reduction  of  current-mode  CMOS  circuit 
implementations. 

4.  Conclusions 

In  this  paper,  we  show  the  effectiveness  of  output 
permutation  for  the  implementation  of  current-mode 
CMOS  circuits.  We  investigated  the  combinations  which 
require  the  lower  costs  than  the  direct  implementation  of  a 
given  function.  As  the  result,  we  have  shown  that  we  can 
reduce  their  cost  for  686  functions  out  of  randomly 
generated  1000  four-valued  two- variable  functions  and  the 
ratio  of  their  saved  cost  is  12.2%  on  average. 

It  remains  as  future  studies  to  clear  the  effectiveness  for 
three-or-more  variable  functions  and  investigate  a  method 
to  find  the  best  output  permutation  directly. 
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Abstract 

This  paper  will  discuss  on  a  logical  model  which  is  suitable  for 
representing  uncertain  statuses  existing  in  multiple-valued  logic 
systems  realized  by  minimum,  maximum  and  literals .  Then, 
some  of  the  mathematical  properties  of  functions  defined  by  the 
logical  model  will  be  presented. 

1  Introduction 

S.  C.  Kleene  [1]  first  defined  regularity  in  the  proposi¬ 
tional  operations  of  ternary  logic  {0, 1,  u}  as  follows.  A 
truth  table  for  a  ternary  operation  is  regular  if  it  sat¬ 
isfies  the  condition  “A  given  column  (row)  contains  1 
in  the  u  row  (column),  only  if  the  column  (row)  con¬ 
sists  entirely  of  1  Js;  and  likewise  for  0” .  Table  1  shows 
truth  tables  of  regular  ternary  operations  given  by  bi¬ 
nary  AND,  OR  and  NOT  operations,  respectively.  In 
Kleene’s  regularity,  0  (false)  and  1  (true)  are  identical 
with  the  truth  values  of  binary  logic;  while  u  is  the  third 
truth  value  whether  it  is  true  (1)  or  false  (0)  is  uncer¬ 
tain  or  undefined.  Therefore,  the  set-values  {0}  and 
{1}  can  be  assigned  to  0  and  1,  respectively.  Moreover, 
since  the  set- value  {0, 1}  consist  of  the  elements  of  the 
specific  truth  values  that  u  may  be  when  it  is  defined, 
the  set-value  {0, 1}  can  be  assigned  to  u.  This  suggests 
that  basic  operations  of  r-valued  logic  can  be  expanded 
into  r-valued  set- valued  logic  in  a  similar  way  as  Kleene 
did  when  r  —  2. 

M.  Mukaidono  have  proven  that  a  ternary  function 
/  :  {0, 1,  u}n  — y  {0, 1,  u}  can  be  represented  by  means  of 
the  regular  operations  defined  by  Table  1  and  the  con¬ 
stants  0,  1  and  u  if  and  only  if  /  is  monotonic  in  the  par¬ 
tial  order  relation  which  is  defined  by  0  -<  u,  1  -<  u 
and  a  -<  a  where  a  €  {0,1,  u}  [2].  Figure  1  shows  a 
Hasse  diagram  of  the  partial  order  relation  It  is  easy 
to  verify  that  the  partial  order  set  ({0,  l,u},  -<)  is  iso- 

Table  1:  Truth  Tables  of  Regular  Operations  NOT,  AND  and 
OR 

NOT  AND  OR 


0  1  u 

0  1  u 

0 

1 

0  0  0 

0  1  u 

1 

0 

0  1  u 

1  1  1 

u 

u 

0  u  u 

u  1  u 

u 


0  1 


Fig.  1.  Partial  Order  Set  {{0,l,u},-<) 


{0,1} 


Fig.  2.  Partial  Order  Set  <{{0},  {1},  {0, 1}},  C) 

morphic  to  the  partial  order  set  ({{0},  {1},  {0, 1}},  C) 
(See  Figures  1  and  2),  where  C  is  the  set-theoretical 
inclusion.  M.  Mukaidono  suggested  an  idea  how  func¬ 
tions  on  {0, 1,  u}  monotonic  in  can  be  expanded  into 
(2r  —  l)-valued  functions  (where  r  >  3)  [3].  Then,  a  way 
for  expanding  operations  on  {0, . . . ,  r  —  1}  into  those  on 
the  set  of  all  nonempty  subsets  of  {0, . . . ,  r  —  1}  (Vr 
say)  will  be  presented  [6].  Operations  on  Vr  given  by 
this  way  will  be  called  regular  because  it  is  an  expansion 
of  Kleene’s  regularity  into  r  >  3.  Now,  minimum,  max¬ 
imum  and  literals  play  an  important  role  in  multiple¬ 
valued  logic  because  they  jure  functionally  complete  with 
constants  on  the  set  {0, . . . ,  r  —  1}  [4].  This  paper  will 
first  introduce  regular  operations  on  VT  given  by  min¬ 
imum,  maximum  and  literals  on  {0, . . . ,  r  —  1}.  Then, 
some  of  the  mathematical  properties  of  functions  on  Vr 
consisting  of  the  regular  operations  will  be  presented. 
Kleene’s  ternary  logic  is  useful  for  representing  and  ana¬ 
lyzing  uncertain  statuses  of  binary  logic  systems  such  as 
transient  behavior  of  binary  logic  circuits  [5].  Therefore, 
our  results  are  useful  for  analyzing  uncertain  statuses  of 
multiple- valued  logic  systems  such  as  transient  behavior 
of  multiple-valued  logic  circuits  realized  by  minimum, 
maximum  and  literals. 
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2  Definitions  and  Properties  of  Func¬ 
tions  Represented  by  Formulas 

Let  r  be  an  r- valued  set  {0, . . . ,  r  —  1},  and  let  Vr  be  the 
set  of  all  nonempty  subsets  of  r,  i.e.,  Vr  =  P(f)\{0}} 
where  V(r)  is  the  power  set  of  r .  If  a  subset  of  r  consists 
of  only  one  element,  then  it  will  be  called  a  singleton. 
The  set  of  all  singletons  of  r  will  be  denoted  by  r,,  i.e., 
r$  =  {{0}, . . . ,  {r  —  1}}.  It  is  evident  that  the  set  Vr 
will  be  a  partial  order  set  when  the  set-theoretical  inclu¬ 
sion  C  employs  in  it.  Figure  3  shows  a  Hasse  diagram 
of  the  partial  order  set  ( VT ,  C)  when  r  is  equal  to  3. 


{0,1,2} 


Fig.  3.  Partial  Order  Set  C)  where  r  =  3 


Table  3:  Truth  Table  of  V 
x\y  I  {0}  {1}  {2}  {0,1}  {0,2}  {1,2}  {0,1, 2} 


{1}  {2}  {0,1}  {0,2}  {1, 2}  {0, 1,2} 

{1}  {2}  {1}  {1,2}  {1.2}  {1,2} 

{2}  {2}  {2}  {2}  {2}  {2} 

{1}  {2}  {0,1}  {0,1, 2}  {1,2}  {0,1, 2} 

{1,2}  {2}  {0,1, 2}  {0,2}  {1,2}  {0,1,2} 

{1,2}  {2}  {1,2}  {1,2}  {1,2}  {1,2} 


{0}  W 
{1}  {1} 

{2}  {2} 

{0,1}  {0,1} 

{0,2}  {0,2} 

{1,2}  {1,2}  ............. 

{0, 1, 2}  {0, 1, 2}  {1, 2}  {2}  {0, 1, 2}  {0, 1, 2}  {1, 2}  {0, 1, 2} 


Table  4:  Truth  Table  of  xa 


X 

{0}{1}  {2}  {0.1}  {0,2}  {1,2}  {0,1, 2} 

{2}  {0}  {0}  {0, 2}  {0, 2}  {0} 
{0}  {2}  {0}  {0, 2}  {0}  {0,2} 
{0}  {0}  {2}  {0}  {0,2}  {0,2} 

{0,2} 

{0,2} 

{0,2} 

Now,  let  us  expand  Kleene’s  regularity  into  operations 
on  VT  below. 

Definition  1:  Let  o  be  an  n-ary  operation  on  r.  Then, 
an  n-ary  operation  o  on  based  on  o  is  defined 
by  the  following  way:  for  any  a\  =  {a*, . . .  jCt™1}, 
On  =  {<£,.. .,<*»}  Of  VT,  = 

mi  mn 

U  •  *  *  U  {°(aiX  *  •  •  • » an")};  ®  wiU  ke  called  a  reg- 

*1—1  *  T»  —  1 

ular  operation  on  Vr . 

The  following  three  types  of  operations  are  basic  in 
multiple-valued  logic  because  r-valued  functions  con¬ 
sisting  of  these  operations  and  the  constants  0, . . . ,  r  —  1 
are  functionally  complete  on  the  set  r  [4]. 

a  •  fc=min(a,  6), 
a  +  6=max(a,  6), 

r  —  1  if  x  =  a, 

0  otherwise, 

where  a,  b  E  r .  The  unary  operations  xa  are  often  called 
literals.  Tables  2,  3  and  4  are  truth  table  of  regular 
operations  on  VT  based  on  *,  +  and  *a,  respectively, 
when  r  =  3.  Because  the  paper  will  focus  on  the  regular 
operations  of  Tables  2,  3  and  4,  they  will  be  denoted  by 
A,  V  and  respectively. 

Definition  2:  Formulas  are  defined  inductively  as  fol¬ 
lows: 

(1)  Constants  {0},  —  1}  and  literals  xf  (i  = 

1, . . . , n;  a  G  r ,)  are  formulas. 


(2)  If  g  and  h  are  formulas,  then  (g  A  h)  and  (g  V  h) 
are  also  formulas. 

(3)  It  is  a  formula  if  and  only  if  we  get  it  from  (1) 
and  (2)  in  a  finite  number  of  steps. 

For  the  sake  of  simplicity,  the  operation  A  will  be  often 
omitted  in  formulas. 

It  will  be  considered  that  every  formula  can  realize  a 
function  on  VT  when  each  variable  Xi  will  take  a  value 
of  the  set  Vr.  Furthermore,  it  is  easy  to  verify  that 
functions  on  Vr  represented  by  formulas  can  not  realize 
all  of  the  functions  on  Vr ,  i.e.,  they  are  not  functionally 
complete  on  Vr .  Thus,  one  of  the  main  subjects  of  the 
paper  is  to  appear  what  functions  on  Vr  can  be  realized 
by  formulas. 

Theorem  1:  Let  /  be  a  function  from  V?  into  Vr 
represented  by  a  formula.  Then,  for  any  element 
(ai,...,On)  6  r is  also  an  element  of 
Tt' 

Proof:  The  theorem  can  be  proven  directly  from  Defi¬ 
nitions  1  and  2.  ■ 

Theorem  2:  Let  /  be  a  function  from  V?  into  Vr 
represented  by  a  formula.  Then,  f(a C 
/(&l, . . .  ,6n)  holds  for  any  elements  (ai,...,an)  and 
(6i, . . . ,  6n)  of  V?  such  that  a*  C  6*  for  ah  i. 

Proof:  It  is  evident  that  any  constants  {0}, . . . ,  {r  —  1} 
and  literals  xf  (where  a  E  r,)  satisfy  the  theorem.  Now, 
let  us  suppose  that  functions  g  and  h  on  Vr  represented 
by  formulas  satisfy  the  theorem,  i.e.,  ^(oi,...,an)  C 
g{b i,"mM  and  Ma! »•••»<*»)  ^  A)  for  el¬ 

ements  (oi, . . . ,  On)  and  (i>i , . . . ,  6n)  of  V*  such  that 
ai  C  b{  for  all  i.  Then,  we  can  easily  verify  by  Defi- 
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nition  1  that 

9{a  1  j  •  •  •  i  ®n)  A  h(fll)  •  •  •  >  ®n) 

C  g(bu  . . . ,  bn)  A  h(6i , . . . ,  6n)  and 
g(aii...,an)Vh(ait...tan) 

Cg(bu...,bn)Vh(bu...,bn). 

This  will  complete  the  proof  of  the  theorem  because  of 
the  induction  of  the  number  of  operations.  ■ 

Theorem  3:  Let  /  be  a  function  from  V™  into  Vr  rep¬ 
resented  by  a  formula,  and  let  a\y . . . , i,  a*+ 1, . . . ,  an 
be  any  fixed  elements  of  Vr.  Then,  for  any  a  and  6  of 
Vrn\rt,  the  least  element  of  the  subset  f(ai, . . . ,  a*_i,  a, 
Ot+i,  ...,On)  of  r  is  equal  to  the  least  element  of  the 
subset  /(ai,...,a*_i,6,a*+i,...,an)>  i-e., 

min  /(ai, ...» ai_i,  a,  a,+i, . . . ,  an) 

=  min  /(ai, . . . ,  Oi_i,  6,  a*+ 1, . . . ,  an). 

Proof:  It  is  evident  that  any  constants  {0}, . . . ,  {r  - 1} 
and  literals  s®  (where  a  E  r,)  satisfy  the  theorem.  Now, 
let  s  =  {s0, • . . ,  sm}  and  t  =  {t0, . . . ,  t,}  be  any  elements 
of  VI»  such  that  So  ^  ^  8m  and  <0  <  •••  <  <!•  Then, 

by  Definition  1,  it  follows  that 


ml  ml 

3  At  =  UIK  Si-tj }  and  sVt  =  U  U^s<  + 

t-Oj-0  *=0j=0 


Therefore,  the  least  elements  of  s  At  and  s  Vt  are  deter¬ 
mined  by  the  least  elements  of  s  and  t.  Thus,  we  will 
be  able  to  prove  the  theorem  by  the  induction  of  the 
number  of  operations.  ■ 

From  Theorems  1,  2  and  3,  any  function  on  Vr  rep¬ 
resented  by  a  formula  satisfies  the  following  Condition 


A. 

Condition  A: 

1.  If  (ai,...,On)  E  r?,  then  /(ai,...,an)  E  r,. 

2.  For  any  (ox,...,an)  and  (hi,...,6n)  of  V* ,  if 
(ai,...,On)  C  (6i,...,6n),  then  /(ai,...,an)  C 

&n). 

3 .  For  any  a  and  b  of  V^r,,  the  least  ele¬ 

ment  of  the  subset  /(ai, . ..  ,a*_i,a, a*+i, ..  .,an) 
of  r  is  equal  to  the  least  element  of  the  subset 
f(alj  —  i  a*+i»  •  •  •  >  an)»  I*e*> 


min  f(a\ , . . . ,  a^_i,  a,  a*+i , . . . ,  On) 

=  min /(ax, . . . ,  a*_i,  6,  a<+i, . . . ,  an), 


where  ai, . . . , a*_i, ai+i, . . .  ,On  are  any  fixed  ele¬ 
ments  of  Vr . 

In  the  remainder  of  the  paper,  it  will  be  proven  that 
Condition  A  is  a  necessary  and  sufficient  condition  for 
a  function  on  Vr  to  be  represented  by  a  formula. 
Reference  [3]  has  proven  a  necessary  and  sufficient  con¬ 
dition  for  a  function  on  {0, 1/2, 1}  to  consist  of  the  vari¬ 
ables  sci,  •  •  • ,  xm  operations  min,  max,  and 

the  constants  0  and  1  (in  which  the  constant  1/2  is  ex¬ 
cluded).  The  condition  of  Reference  [3]  is  identical  with 
the  first  and  second  conditions  of  Condition  A  when  0, 


1  and  1/2  will  refer  to  {0},  {1}  and  {0, 1},  respectively. 
That  is,  the  third  condition  can  be  removed  when  r  —  2. 
This  is  because  there  exists  only  one  element  of 
when  r  =  2;  and  so  the  third  condition  A  is  trivial  when 
r  =  2. 

3  Properties  between  Functions  Satisfy¬ 
ing  Condition  A  and  Formulas 

All  of  the  proofs  of  Sections  3  and  4  will  be  removed 
because  of  the  limitation  of  the  paper. 

The  previous  section  have  shown  that  any  function  on 
Vr  represented  by  a  formula  satisfies  Condition  A.  From 
now  on,  it  will  be  proven  that  any  function  on  Vr  satisfy¬ 
ing  Condition  A  can  be  represented  by  a  formula,  and 
shown  a  way  how  a  formula  can  be  constructed  from 
a  given  function  on  Vr  satisfying  Condition  A.  That  is, 
Condition  A  will  be  a  necessary  and  sufficient  condition 
for  a  function  on  Vr  to  be  represented  by  a  formula. 
This  section  will  discuss  some  of  the  properties  existing 
between  functions  satisfying  Condition  A  and  formulas. 
Definition  3:  Let  /  be  a  function  from  Vf1  into  Vr, 
and  let  A{  =  (ox , . . . ,  a*-!,  a*+i, . . . ,  an)  be  an  element 
of  rn-i  Then,  we  will  define  two  types  of  functions 

a ..,«„(*)  /ai,. ...,<*»(*)  (* 

1, . . . ,  n)  on  Vr  by  the  following  formulas,  respectively: 


fa+ 


$er 


WA  V  **>(!) 


where  P^.{s)  =  {b  £  V^|  min /(ai,...,  Oi_i,fc,a,+i - , 

an)  =  s},  and  P£(s)  is  the  set  of  all  maximal  elements 
of  the  set  and 


V  V 


{*}  A 


v  (a-H 


(2) 


where  Q^.(a)  =  {b  E  Ir\r,|/(ai,...,a*_i,&,ai+i - - 

an)  =  s}  and  Q^"(s)  is  the  set  of  all  minimal  elements 

°f%i(5)* 

In  formulas  (1)  and  (2),  if  P£t(s)  and  Q^”.(a)  are  the 
emptyset,  then 


\f  xb  and  V 


0}  a  |  v  (A*{m} 

b fQ^)  'm€6  y 


are  defined  by  {0};  and  if  (a)  is  {{0, 1, . . . ,  r  —  1}}, 


then  is  defined  by  the  constant  r.  Further¬ 

more,  if  a  function  /  on  VT  satisfies  Condition  A  and 
an  element  a  of  V^\r#  belongs  to  a  set  P^a),  then  the 
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element  {0, 1, ...» r  -  1}  of  VT  has  to  also  belong  to  the 
set  P^.(s)  because  f  satisfy  the  second  and  the  third 

conditions  of  Condition  A.  Thus,  in  this  case, 
wiU  be  the  set  {{0, 1, . . . ,  r  -  1}}.  Therefore,  because 
p£(s)  =  {r}  or  P£(s)  C  r,  holds  for  each  P£(s), 
formula  (1)  is  well-defined  when  f  satisfies  Condition 
A. 

Lemma  1:  Let  /  be  a  function  from  V™  into  VT  sat¬ 
isfying  Condition  A,  and  let  oi, . . .  a»+i>  * . . ,  an 

be  elements  of  rg.  Then,  for  every  a  G  Vr,  there 
exists  a  subset  K  of  r  such  that  {/o}  C  K  C 
f(au  •  •  ■ ,  <H-U  a,  -‘.On)  (in  which  f0  is  the  least 
element  of  /(ai, . . . ,  a*_x,  a,  a*+x, . . .  ,an)),  and  the  fol¬ 
lowing  equality  holds  for  the  element  a: 

/*+  (a)  = 

r/(^(a))  if /(4(a))  Gr,  (3) 

\  K  otherwise  v 

where  Ai(a)  =  (ai, . . . ,  Of_i,  a,  Oj+x, . . . ,  an)- 
Lemma  2:  Let  /  be  a  function  from  VJ.n  into  VT  satis¬ 
fying  Condition  A,  and  let  ax,..  .  ,a*_x,Ot+i,an  be  el¬ 
ements  of  rt.  Then,  the  following  equality  holds  for 
every  a  EVr: 


(a)  = 


r  {0}  iff(Ma))er,  f4) 

\{0}  u  /(At*(a))  otherwise  v  ; 

where  A<(a)  =  (ax, . . .  ,ai_i,  a,  a*+i, . . .  ,an). 

Example  1:  Consider  the  function  /  on  V3  defined  by 
Table  5.  It  is  easy  to  verify  that  /  satisfies  Condition 
A.  Since 


P{0}(0)={°  €  V*\ ^nf(a>  {°})  =  °} 

={{0},  {2},  {0, 1},  {0, 2},  {1, 2},  {0, 1,2}}, 

i5{1o}(1)={a  G  V*\  ““/(“i  W)  =  !}  =  {(!}}> 
•p{0}(2)={°  ^  V8\ min  /(a,  {0})  =  2}  =  0, 

we  have  Pj+( 0)  =  {{0,1,2}},  P$(l)  =  {{1}}  and 
P^(2)  =  0.  Therefore, 

/{+(«)=({0}  A  {0, 1, 2})  V  ({1}  A  xW)  V  ({2}  A  {0}) 
={l}x™.  (5) 


In  a  similar  way, 

/{+(*)  =  {i}*<1}v*{2V 

)  = 

/{o}(y)  =  {®}> 

=  {!}  V  y{2}, 

fUy(y)  =  »{1}- 


(6) 


Table  5:  Example  of  Function  /  on  Vj  Satisfying  Condition  A 


g\y 


{0}  {1}  {2}  {0,1}  {0,2}  {1,2}  {0,1,2} 


{0}  I  {0}  {0}  {0}  {0}  {0}  {0,2}  {0,2} 

{1}  {1}  {1>  W  W  t1-2}  t1-2)  i1’2} 

{2}  {0}  {2}  {0}  {0,2}  {0}  {0,2}  {0,2} 
{0,1}  {0,1}  {0,1}  {0,2}  {0,1, 2}  {0,1, 2}  {0,1, 2}  {0,1, 2} 
{0,2}  {0}  {0,2}  {0}  {0,2}  {0}  {0,2}  {0,2} 
{1, 2}  {0, 1, 2}  {0, 1, 2}  {0, 2}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2} 
{0, 1, 2}  {0, 1, 2}  {0, 1, 2}  {0, 2}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2} 


Moreover,  since 

<?! 0}({°.  !})={«  e  F3\3,l/(«.{0})  =  {0, 1}} 

={{0,1}}, 

<?{0>({0.2})={o  G  V3\8,|/(a,{0})  =  {0,2}}  =  0, 
<?}0}({l,2})={a  €  m3,l/M0})  =  {1.2}}  =  0, 
<?{0}({0,l,2})={a  G  V3\3.|/(«,{0})  =  {0,1,2}} 
={{1,2},  {0,1, 2}}, 

we  have  Qj-}({°,  i»  =  {{0,1}},  Q}o}({°.2})  = 

Q{0}({l-2})  =  and  <?{o}({°>  1.2})  =  {{1.2}}-  There- 
fore, 


/{V}(*)={  ({0}  A  *<°>*{1})  V  ({1}  A  *<0>*{1})  }  V 
{0}  v  {0}  V  {  ({0}  A  a;{2}*{2})  V 
({1}A^*(2>)v({2}AxW)} 
={1}*{°>*11>  V  {l}*!1!*!2!  V  x^x™.  (7) 


In  a  similar  way, 

fl{-}(x)  =  {I}**0***1!  V *«V3}  V  {I}*!1)*!2! ’I 
VxWxW, 


fl;}(x)  =  xWxMvxWxW, 

floyiy)  =  y{liy{2}> 

f*iy{y)  =  {l}y{0M2}  V  V  {l}y*1}y{2} 

Vyl^yl2}, 


P) 


f{2}(y)  =  y{0V1}  V  y{1}y{2}. 


Table  6  shows  truth  tables  of  /*+  and  for  which 
*  =  1, 2  and  a  G  {{0},  {1},  {2}}. 

Lemma  3:  Let  /  be  a  function  from  V?  into  VT  satis¬ 
fying  Condition  A,  and  let  ax, ... , a*_i,  ai+x, . . . ,  an  be 
elements  of  r$.  Then,  the  following  equality  holds  for 
every  a  £Vr: 

=  HMa))> 

where  A<(a)  =  (ax, . . .  ,Oi_x, a, a*+i, . . . , On). 
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Table  6:  Truth  Table  of  /A+  and  f'a~  (*  =  1,2; 
a  €  {{0},  {1}»{2}}) 

*  |{0}{1}{2}{0,1}  {0,2}  fl,2>  {0,1,2} 

7^j{0}{l}{0}{0,l}  {0}  {0,1}  {0,1} 

A+}(*)  {0}  {i}  {2}  {0,  i}  {0,2}  {0,1, 2}  {0,1, 2} 

4+}(x)W^>{0}{0’2}  {0}  {0>2}  {°’2} 

/]"(*)  {0}  {0}{0}{0,i}  {0}  {0,1, 2}  {0,1,2} 

A“}(*)  W{0}  {0}{0,1}  {0,2}  {0,1, 2}  {0,1, 2} 

/m(*){0}{0}{°}{°,2}  {0}  {0,2}  {0,2} 

7£R{o}{o}{o}  {0}  {0}  {0}  {0} 

0>  0.2}  {1,2}  {1,2} 
/l+}(*)  {°H2H°}  {0,2}  {0}  {0,2}  {0,2} 

A~}(*)  {0}  {0}  {0}  {0}  {0}  {0,2}  {0,2} 

/b){OHOHO)  {0}  {0,1, 2}  {0,1, 2}  {0,1, 2} 
4“}(*){0}  {0}  {0}  {0, 2}  {0}  {0,2}  {0,2} 

Table  7:  Truth  Table  of  /„  =  f'a+  V  fi~  (*  =  1,2; 
«€{{0},{1},{2}}) 

£  |{0}  {1}  {2}  {0, 1}  {0,  2}  {1,2}  {0,1,2} 
/p{0}{l}{0}{0.1}  {0}  {0,1,  2}  {0,1, 2} 
/}1}W  {0}  {1}  {2}  {0, 1}  {0, 2}  {0, 1,  2}  {0, 1, 2} 

/L(*){°H2}{0}{0,2}  {0}  {0,2}  {0,2} 

/?.,(•)  {0H°}{0}  {0}  {0}  {0,2}  {0,2} 

A}(*)  {1}{1}  {2}  {1}  0,2}  {1,2}  {1,2} 
/£}(*)  {0}  {2}  {0}  {0,2}  {0}  {0,2}  {0,2} 


and  . .„(*)  =  V 

/oi1...Ioi_i,oi+1, ...,«„(*)• 

Here,  a  special  subset  /(r,n)  of  V?  will  be  defined  in 
the  following  way: 

n 

I(r,n)  =  U{(alt...,an)  €  Vrn\ a<  €  Vr\r,  and 

t  =  l 

ai , . . . ,  at*_x,  flt+l)  •  •  • » an  G  r*} 

That  is,  for  every  element  (ai, . . . ,  an)  of  /(r,n),  there 
exist  only  one  element  of  VT\rs  and  n  —  1  elements  of 

T* 

Lemma  4:  Let  /  be  a  function  from  V?  into  Vr  satis- 
fying  Condition  A.  Then,  for  every  (ai,...,an)  £  Vrn , 
there  exists  a  subset  K  of  r  such  that  {0}  C  K 
{0}  U /(ai, . . . ,  On),  and  the  following  equality  holds  for 
the  element  (ax, . . . ,  a*): 

/i(°i,  •  •  •  J  an)  = 

f/(ax,...,an)  if  /(ax,...,an)  £  r,  U/(n,r) 

[  K  otherwise. 

Example  3:  Consider  the  function  /  defined  by  Table 
5.  Then,  by  equalities  (5)  and  (6), 

f{0}(x)—f{0}(x)  v  f{o}(x) 

={l}z™  V  {l}»t°>a«  V  {l}*™*™  V 
={!}»«  V 


Example  2:  Consider  the  function  /  defined  by  Table 
5.  By  Lemma  3,  it  follows  that 

/(*,  o)=/i+(*)  V  fl~  (*),  and 

f{a,y)-fa+(y)  v/2-(y) 

hold  for  every  a  €  {{0},  {1},  {2}},  where  /i+(*), 
/2+(y)  and  fl~{y)  have  been  given  in  equa- 
tions  (5),  (6),  (7)  and  (8).  Table  7  shows  truth  tables 
of  /*+  V  for  which  i  =  1,2  and  a  £  {{0},  {1},  {2}}. 

4  Realization  of  Formulas  for  Functions 
Satisfying  Condition  A 

This  section  will  prove  that  any  function  on  VT  satisfy¬ 
ing  Condition  A  can  be  represented  by  a  formula,  and 
also  show  a  way  how  a  formula  can  be  constructed  from 
a  given  function  on  VT  satisfying  Condition  A. 
Definition  4:  Let  /  be  a  function  from  Vrn  into  Vr. 
Then,  a  function  f\  on  Vr  will  be  defined  in  the  following 
formula: 

/l(*lj  •  •  •  i  xn)  =  Vi=l  /  *(®1»  *»)» 

where 

/  =  V 

n 

A  . 


In  a  similar  way,  by  equalities  (6),  (7)  and  (8), 
/{i}(*)={1>a;{1>  v  *{2}» 

f{o}iy)=y{1}y{'}' 

/{2i}(y)={i}vy{2}, 

ffa(y)=y{1}- 

Therefore,  fi(x,  y)  of  Definition  4  is  given  by  the  fol¬ 
lowing  formula: 

/i(*,y)  =  /l(*,y)  v/2(*,y),  (9) 

where 

f1=y{0}f{ o>  (*)  v  y{l}/{i}(*)  v  y{2}/{2}(*)> 
/2=*{0}/20}(y)  v  *{1}/2i}(y)  v  *{2}/22}(y)- 


Table  8:  Truth  Table  of  f\ 


{«} 

W 

{2} 

{0,1} 

{0,2} 

{°J-4> 

{0} 

{0} 

{0} 

{0} 

{0} 

{0,2} 

{i} 

{1} 

{1} 

{2} 

{1} 

{1,2} 

{1,2} 

{1,2} 

{2} 

{0} 

{2} 

{0} 

{0,2} 

{0} 

{0,2} 

{0,2} 

{0,1} 

{0.1} 

{0,1} 

{0,2} 

{0,1} 

{0,1,2} 

{0,1,2} 

{0.1,2} 

{0,2} 

{0} 

{0,2} 

{0} 

{0,2} 

{0} 

{0,2} 

{0,2} 

{1,2} 

{0,1,2}  {0,1,2} 

{0,2}  {0,1, 2} 

{0,1,2}  {0,1,2}  {0,1,2} 

{0, 1, 2}|{0, 1,2}  {0,1,2} 

{0,2}  {0,1, 2} 

{0,1,2}  {0,1,2}  {0,1,2} 
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Table  8  shows  a  truth  table  of  /i(®,y). 

Definition  5:  Let  /  be  a  function  from  V™  into  Vr. 
Then,  a  function  /2  on  Vr  will  be  defined  in  the  following 
formula: 


Table  9:  !T($)  and  T”(s)  where  a  €  {{0,1},  {0,2},  {1,2}, 

{0, 1,  2}}:  All  of  the  elements  («,y)  of  V*  corresponding  to  T(s) 
are  elements  of  T(s);  and  all  of  the  elements  (®,y)  of 
corresponding  to  jT-(s)  are  elements  of  T(a)  and  T~(a ). 


/a(«i,  •••>*»)  = 


Wv 


V  1  \/{0  A  •  •  •  i  *»)  |  i 

•evr\T.  Ue»  J. 


where  /,(* j  A  xi>} 

(ai,...,a„)€T“(*)  \b£a  i 

A*-- A  A  f>  T(s)  (where  s  €  ^V»)  is  the  set 
tea™  ) 

of  all  elements  of  V*  such  that  f(ai, . . . ,  on)  =  s  and 
(oi,...,o„)  g  r"  U  I,  T~(s)  is  the  set  of  all  mini- 
mal  elements  of  T(s),  and  so  is  the  least  element  of 

U  /(ai, . . . ,  On)- 

Lemma  5:  Let  /  be  a  function  from  into  VT  satis¬ 
fying  Condition  A.  Then,  for  any  element  (ai,...,an) 

of  V7n, 

/2(ai,...,an)  = 

f  {«o}  if  (ai,...,  «n)  GfjUJ(n,r) 

{ /(ai,  . . . ,  a*)  otherwise, 

where  so  is  the  least  element  of  the  set 

U  /(oi, ...  ,<*«)• 

Example  4:  Consider  the  function  /  defined  by  Ta¬ 
ble  5.  Since  each  T“(s)  (s  €  {{0, 1},  {0, 2},  {1, 2}, 
{0, 1,  2}}),  is  given  by  Table  9,  each  f$  of  Definition 
5  is  expressed  in  the  following  formulas: 

/{0,1}=a!{0}x{1}y{0}  V  *{0}*{1>y{1}, 

/{o'2}=*{0}y{1}y{2}  V  »«y«Vl}  V  *<  W}  V 
xwywym  v  xwxwyw  v  xwxmyw, 

/{0,i,2>=*{l}*{J}y{0}v*<1>^2}y{1>. 

Therefore,  /2(:r,y)  of  Definition  5  is  given  by 

/2(*.y)={0}  v  ({0}/{o,i}  v  {1}/{0,1>)  V 

({0}/{o,2}  v  {2}/{0,2})  v  ({1}/{i,2}  v  {2}/{1,2})  V 

({°}/{0,X,2}  V  {1}/{0,1,2}  V  {2}/{0,l,2}) 

={1}/{0,1}  V  A 0,2}  V  {1}/{1,2}  V  /{i,2}  V 
{1}/{0,1,2>  V  /{0,1,2}  (1®) 

Table  10  shows  a  truth  table  of  /2(®,2/)* 


x\y 

{0,1} 

{0,2} 

{1,2}  {0,1,2} 

{0} 

{1} 

{2} 

r-({0,2}) 

T~  ({1,2» 

r-({0,2})T({0,2}) 

r-({l,2})T({l,2}) 

T-({0,2})r({0,2}) 

Ay 

1  {0} 

{1} 

{2} 

w 

{0,2} 

{1,2} 

{0,1,2} 

r-({0,l»  T-({0,1»  T-({0,2}) 

r-({0,2}) 

T~  ({0, 1, 2})  T"  ({0, 1, 2})  r-({0, 2}) 
T({0,1,2})  T({0, 1, 2})  T({0,2}) 

Table  10:  Truth  Table  of  /2(*,y) 


Ay 


{0}  {1}  {2}  {0,1}  {0,2}  {1,2}  {0,1,2} 


o>{o>  (oT  {0} JGf  {oT 
{0}{0}  {0}  {0}  {0}  {0} 

{o}{o>  {0}  {o}  {0}  {0} 

{0}  {0}  {0, 1, 2}  {0, 1, 2}  {0, 1,2}  {0, 1, 2} 
{0}  {0}  {0,2}  {0}  {0,2}  {0,2} 

{0}  {0}  {0, 1 , 2}  {0, 1, 2}  {0, 1 , 2}  {0, 1, 2} 

{0}  {0}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2}  {0, 1, 2} 


{0} 
{1} 
{2} 
{0,1} 
{0,2} 
{1,2} 
{0,1, 2}| 


To 

{0} 

{0} 

{0} 

{0} 

{0} 

{0} 


Theorem  4:  Let  /  be  a  function  from  VJ1  into  VT  sat¬ 
isfying  Condition  A.  Then,  for  every  (oi, . . . ,  an)  of  V^n, 

/(oi,...,°n)  =  /i(oi,...,on)  V/2(oi,...,an), 

where  f\  and  /2  are  formulas  consisting  of  /  by  Defini¬ 
tions  4  and  5. 

By  Theorem  4,  it  follows  that  the  function  /  of  Table 
5,  which  satisfies  Condition  A,  is  represented  by  the 
formulas  fi{x}y)  and  /2 (x}y)  of  (9)  and  (10). 

5  Conclusions 

This  paper  have  discussed  an  extension  of  Kleene’s  reg¬ 
ularity  into  Vr .  Then,  a  necessary  and  sufficient  condi¬ 
tion  for  a  function  on  Vr  to  be  realized  by  A,  V,  x^A 
(where  a  E  r)  and  the  constants  {0}, —  1}  has 
been  proven. 
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A  Super  Switch  Algebra  for  Quantum  Device  based  Systems 

Gerhard  W.  Dueck*Mou  Hut  and  Blair  Fraser* 


Abstract 


2  Review  of  the  Super  Switch  Model 


In  this  paper  a  multiple-valued  algebra,  super  switch  al¬ 
gebra,  is  formalized.  The  algebra  is  well  suited  for  the  de¬ 
sign  of  quantum  device  based  multiple-valued  systems .  A 
minimization  algorithm  based  on  this  algebra  is  proposed. 
This  new  algebraic  algorithm  has  some  advantages  over  the 
existing  map-based  procedure  and  subfunction-based  algo¬ 
rithm.  Benchmark  functions  are  minimized  and  the  results 
are  discussed.  Further  improvements  to  the  algorithm  are 
suggested. 


1  Super  Switch  Algebra 


With  rapid  increase  in  integrated  circuit  density,  which 
will  likely  reach  its  limits  in  the  near  further,  the  search 
for  alternative  technologies  is  on.  Among  many  emerg¬ 
ing  technologies,  multiple-valued  systems  based  on  quan¬ 
tum  devices  are  most  promising  [2,  8,  1 1,  13]. 

The  objective  of  this  paper  is  to  propose  an  algebraic 
structure  that  can  be  used  as  a  foundation  for  designing  the 
next  generation  VLSI  circuits.  To  achieve  this  goal  the  su¬ 
per  pass  gate  model  (renamed  as  super  switch  model  in  this 
paper)  proposed  by  Deng  et  al.  [3]  is  reviewed  first.  Then 
based  on  the  switch-level  algebra  proposed  by  Hu  [6],  the 
super  switch  algebra  is  constructed.  Properties  of  super 
switch  algebra  are  studied  in  detail.  Finally  an  algorithm 
based  on  super  switch  algebra  for  design  of  quantum  device 
based  multiple- valued  digital  systems  is  proposed.  The  al¬ 
gorithm  has  been  implemented  in  C++.  Benchmark  func¬ 
tions  are  minimized  to  show  the  viability  of  the  program. 
Careful  analysis  of  some  functions,  allows  us  to  propose 
improvements  to  the  algorithm. 


*  Department  of  Mathematics,  Statistics,  and  Computer  Science,  St. 
Francis  Xavier  University,  Antigonish,  N.S.,  B2G  2W5,  CANADA, 
gdueck@stfx.ca.  Research  is  partially  supported  by  NSERC. 

t  Aepos Technologies,  Adga Group, Ottawa,  Ont.  KIP 5G3,  CANADA. 
Research  was  initiated  while  visiting  St.  Francis  Xavier  University  as 
James  Chair  Professor. 


After  surveying  many  papers  on  quantum  devices  [2,  3, 
6,  8,  9,  10,  11,  12,  13]  we  found  that  the  super  pass  gate 
model  [3]  is  one  of  the  most  promising  models  in  this  cat¬ 
egory.  In  this  paper,  we  adopt  super  pass  gate  model  and 
rename  it  as  super  switch,  which  will  be  used  as  a  sug¬ 
gested  building  block  of  next  generation  VLSI  multiple¬ 
valued  systems.  The  following  is  a  brief  review  of  the  super 
switch  model. 

A  super  switch  is  a  device  with  a  data  terminal  D ,  a  con¬ 
trol  terminal  C ,  an  output  terminal  O,  and  a  characteristic 
set  S  as  illustrated  in  Fig.  1(a).  The  function  of  a  super 
switch  is  depicted  in  Fig.  1(b).  When  the  control  signal  C 
takes  value  from  the  characteristic  set  S ,  the  switch  is  on 
and  data  D  is  passing  to  output  O.  Otherwise,  the  switch  is 
off  and  data  D  cannot  pass  to  output  O. 


SwitchState 


D- 


X 


o 


On 

Off 


s  =  {0,2} 
c 


(a) 


0  12  3 

(b) 


Figure  1 .  Super  switch  model. 


Since  the  characteristic  set  S  of  a  super  switch  is  pro¬ 
grammable,  there  will  be  r2  different  super  switches  in  an 
r-valued  system  including  two  trivial  switches,  always  on 
and  always  off.  The  possible  implementation  of  a  super 
switch  using  quantum  device  technology  was  proposed  in 

[3]. 


3  Formalization  of  Super  Switch  Algebra 

Deng  et  al.  proposed  two  minimization  algorithms  for 
systems  using  super  switches  (superpass  gates)  in  [4]  and 
[3].  But  the  mathematical  foundation  of  super  switch  sys¬ 
tems  is  still  incomplete.  By  carefully  examining  the  concept 
of  super  switch,  we  found  that  the  switch-level  algebra  pro¬ 
posed  by  Hu  [6]  can  be  used  as  a  basis  for  developing  an 
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algebraic  structure  that  will  include  the  super  switch  model. 
Actually  if  we  extend  the  logic  value  set  used  in  switch- 
level  algebra  and  replace  positive  and  negative  switches  by 
super  switches,  a  new  algebraic  structure  can  be  formalized. 
We  will  call  this  new  algebraic  structure  super  switch  alge¬ 
bra  (SS  algebra  for  short)  which  serves  as  a  mathematical 
model  for  analyzing,  designing,  verifying,  and  testing  quan¬ 
tum  device  based  VLSI  systems.  This  new  algebra  will  be 
compatible  with  definitions  proposed  by  Deng  et  al.  but  in 
a  mathematically  complete  form. 

3.1  Logic  Value  Sets  of  SS  Algebra 

We  define  a  functional  logic  value  set  R  = 

{0, 1, . . .  ,  r  - 1},  where  0, 1, . . .  ,  r  - 1  are  the  logic  values 
used  in  a  sytem  with  radix  r.  Inputs  and  outputs  of  systems 
should  take  values  in  R.  When  we  functionally  specify  a 
system,  this  set  will  be  used. 

The  output  O  of  a  super  switch  may  be  in  a  high 
impedance  state  when  the  control  varible  is  not  in  the  set 
5.  We  must  be  able  to  incorporate  this  into  the  alge¬ 
bra.  Hence,  we  define  an  intermediate  logic  value  set 
/  =  R  U  { z }  =  {z,  0, 1, . . .  ,  r  —  1}  where  z  represents 
the  high  impedance  state.  Signals  inside  a  system  should 
take  values  in  /.  This  set  will  be  used  during  the  design 
phase  of  a  system. 

The  total  logic  value  set  T  =  /  U  {«}  = 

{z,  0, 1, . . . ,  r—  1,  u},  where  u  represents  an  undertermined 
logic  value  resulting  from  conflicting  logic  values.  This  set 
is  used  during  design  verification.  In  a  properly  designed 
system,  u  should  never  appear. 

In  the  total  logic  value  set  T,  we  define  a  “logic  strength 
greater  than  and  equal  to”  relation.  For  every  value  in  T, 
logic  strength  is  assigned  as  follows. 

•  u’s  logic  strength  is  the  greatest. 

•  z’s  logic  strength  is  the  weakest. 

•  0,1,...  ,  r  —  1  cannot  compare  with  each  other. 

“Logic  strength  greater  than  or  equal  to”  relation  (denoted 
as  >  relation)  is  a  partial  order  in  T.  Thus,  <  T,  >>  is  a 
lattice  as  depicted  in  Figure  2. 


u 


Figure  2.  The  lattice  for  the  SS  algebra. 


Two  operators,  connection  operator  and  super  switch  op¬ 
erator,  will  be  defined  for  the  super  switch  algebra.  For  the 
following  definitions,  x  and  y  are  constants  or  functions  in 
T.  The  connection  operator  #  is  defined  as: 

x#y  =  LU  B(x,y)  (1) 

where  LU B(x ,  y)  is  the  least  upper  bound  of  x  and  y.  In  set 
/,  the  #  operator  is  the  same  as  the  +  operator  proposed  by 
Deng  et  al.  in  [3].  However,  the  #  operator  has  a  significant 
advantage  over  the  +  operator,  because  it  allows  for  the  con¬ 
nection  with  all  values  in  T.  The  +  operator  disallowed  the 
connection  of  two  logic  values  for  which  the  >  relation  was 
not  defined.  By  using  the  #  operator,  such  a  connection  will 
yield  a  u  indicating  that  this  is  not  a  valid  output  of  the  sys¬ 
tem,  and  thus  it  can  be  used  during  the  design  verification 
phase.  The  connection  operator  is  a  mathematical  model  of 
a  connecting  line  in  a  digital  circuit. 

The  super  switch  operator  $5  is  defined  as 

(  y  if  x  E  S 

?/  $5  x  =  <  z  if  x  $  S  and  x  /  u  and  y  ^  u  (2) 
[u  \fx  =  uory  =  u 

where  y  is  the  pass  variable;  x  is  the  control  variable;  x,  y  6 
T;  S  is  the  characteristic  set  of  the  SS  operator;  SCR. 

It  is  easy  to  see  that  the  super  switch  operator  serves 
as  a  mathematical  model  for  the  super  switch  described  in 
Section  2.  For  the  super  switch  shown  in  Fig.  1,  we  have 
O  =  D  $5  C.  Deng  et  al.  [3,  4]  define  a  universal  lit¬ 
eral  <  y  >  xs  whose  function  is  similar  to  the  function  of 
our  super  switch  operator  (i.e.  y  $s  x).  We  chose  the  new 
notation,  because  the  super  switch  is  a  binary  operation  be¬ 
tween  two  arbitrary  functions.  The  former  notation  gives 
the  impression  that  the  control  variable  is  limited  to  input 
variables  only. 

4  Properties  of  the  Super  Switch  Algebra 

In  this  section  we  will  study  the  properties  of  super 
switch  algebra  in  detail.  All  the  properties  will  presented  in 
the  form  of  laws.  In  our  discussion  we  adopt  the  following 
convention:  in  an  expression,  the  priority  of  $s  operation 
is  higher  than  the  priority  of  #  operation.  Unless  otherwise 
specified,  A,  B ,  C,  x,  xi,  x^  are  functions  in  T;  a  is  a  value 
in  R;  5,  51,  52  are  subsets  of  R\  and  $  is  an  empty  set. 

Law  1.  Commutative  law  of  #  operation 

A#B  =  B#A 

Law  2.  Associative  law  of  #  operation 

A#(£#C)  =  (A#£)#C 
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Law  3.  Idempotent  law  of  #  operation 

A#A  =  A 

Law  4.  Unit  element  and  zero  element  of  #  operation 

0  is  the  unit  element  of  #  operation: 

z#A  =  A#z  =  A 
u  is  the  zero  element  of  #  operation: 

u#A  =  A#u  =  u 

Law  5.  Law  of  changing  the  order  of  two  $5  operations 

A$slzi  $S2  x2  =  A$s 2  x2  $51  X\ 

Law  6.  Associative  law  of  S5  operation 

(A  $51  x\)  $S2  x2  =  A $51  (zi$S2*2) 

Law  7.  Law  of  repeating  a  operation 

A$sx$sx  =  A$sx 

Law  8.  Zero  elements  of  operation 

(1)  u  is  the  zero  element  of  $s  operation  in  T: 

A$su  =  u$sA  =  u 

(2)  2  is  the  zero  element  of  $s  operation  in  7: 

Law  9.  Distributive  law  of  Ss  operation  to  #  operation 

(A#B)  $sx  =  (A$s  x)#(B$s  x) 

Law  10.  Absorption  law 

If  x  G  7,  we  have 

A#(A  $5  x)  =  A 

Law  11.  The  first  law  of  merging  two  $5  operations 

A$S1  x#A$S2x  =  A$SWS2x 

Law  12.  The  second  law  of  merging  two  $s  operations 

A$slx$S2x  =  A$slnS2x 

Law  13.  Law  of  deleting  a  $R  operation 

If  X  G  Ry 

A$rx  =  A 


Law  14.  Substitution  law  of  $a  operation 

a  $a  x  =  x  $a  x 

Law  15.  Law  of  deleting  an  SS  term 

If  Ay  x  G  /, 

A  $*  *  =  z 

All  the  above  laws  can  be  proven  by  using  definitions  of 
#  and  $5  operations,  and/or  by  listing  all  possible  combina¬ 
tions.  A  sample  proof  for  Law  6  can  be  found  in  [7]. 

5  Use  of  Super  Switch  Algebra  to  Express  Su¬ 
per  Switch  Networks 

A  super  switch  network  is  an  extension  of  a  pass  transis¬ 
tor  network.  There  are  basically  two  types  of  super  switch 
networks,  i.e.,  series-parallel  network  (two  levels)  and  tree¬ 
like  network.  We  will  only  consider  series-parallel  network 
in  this  paper. 

The  following  are  some  definitions  of  terms  to  be  used 
to  express  a  series-parallel  super  switch  network.  Some  of 
them  are  similar  to  the  definitions  in  [4].  An  SS-term  (also 
referred  to  as  a  product  term)  consists  of  a  data  function  and 
a  series  of  control  variables  connected  with  super  switch 
operators  (SS  operators  for  short).  We  write  the  SS-term  A 
as  follows 

A  =  ya$Sal  x !  $5a2  x3...  $5an  xn 

A  standard  SS-term  is  one  in  which  every  variable  ap¬ 
pears  exactly  once.  According  to  Law  13,  any  SS-term  that 
does  not  include  the  variable  Xj  can  be  transformed  into  a 
standard  SS-term  by  adding  %R  x j.  A  fundamental  SS- 
term  is  a  standard  SS-term  with  the  following  characteris¬ 
tics:  (1)  its  data  function  is  a  constant;  (2)  the  characteristic 
set  of  each  SS  operator  contains  only  one  logic  value.  It 
is  easy  to  see  that  one  fundamental  SS-term  corresponds 
to  one  row  of  a  truth  table  of  a  logic  function.  We  some¬ 
times  refer  to  a  fundamental  term  as  a  minterm.  It  has  been 
proven  [4]  that  any  n-variable  r-valued  function  F(X)  can 
be  expressed  in  the  form  of  a  parallel  connection  of  funda¬ 
mental  SS-terms  (also  called  sum  of  fundamental  SS-terms) 
as 

Y.  F(ai,a2, ...  ,an)  $al  x\  $a2  x2 . . .  $on  xn 

ai,a2,...  ,a»*=0 

where  the  symbol  E  represents  the  connection  operators. 

A  sum  of  products  is  a  series  of  one  or  more  SS-terms 
connected  with  the  connection  operator  #.  By  combin¬ 
ing  some  fundamental  SS-terms,  any  function  can  be  rep¬ 
resented  by  a  sum  of  products.  An  SS-term  A  is  said  to 
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be  an  implicant  of  the  function  f(X),  if  and  only  if  for 
some  assignments  of  x,A{x)  ^  z  and  for  every  assign¬ 
ment  of  x  either  A(x)  =  f(x)  or  A(x)  =  z.  An  im¬ 
plicant  A  is  said  to  be  prime  if  there  is  no  other  impli¬ 
cant  B  such  that  Vz,  A(x)  ^  z  =>  A(z)  =  B(x)  and 
3x ,  A{x)  =  z  where  B(x)  E  -R.  It  is  easy  to  see,  that  any 
function  can  be  expressed  by  a  sum  of  products,  where  all 
SS-terms  are  prime  implicants.  An  SS-term  A  subsumes 
another  SS-term  B  if  and  only  if  A  is  an  implicant  of  the 
function  F(X)  =  B. 

The  consensus  operations  has  been  successfully  used  to 
find  all  prime  implicants  of  a  MVL  function  [1,  5].  The 
consensus  operation  of  two  SS-terms  is  defined  to  produce 
implicants  that  can  be  obtained  from  the  two  given  impli¬ 
cants.  An  implicant  generated  by  the  consensus  operation 
from  given  terms  A  and  B ,  covers  minterms  from  both  A 
and  S.  Since  the  pass  signal  of  an  SS-term  could  be  either 
a  constant  or  a  variable,  the  definition  of  consensus  is  more 
complex  than  the  conventional  one  (such  as  the  one  de¬ 
scribed  by  Allen  and  Givone  [1]).  The  consensus  in  the  ith 
dimension  between  the  following  two  standard  SS-terms, 
where  1  <  i  <  n 


A  =  ya  $Sal  xi  $Sa2  x2  . . 

, .  $San  x„ 

B  =  yb  $s61  X!  $S62  x2  . . 

.  $S6n  xn 

is  denoted  as  A  **  B  and  can  be  defined  as  follows: 
1.  If  both  ya  and  t/&  are  variables  and  ya  =  yb. 


Figure  4.  Illustration  of  case  2. 


3.  If  both  ya  and  yb  are  constants  and  ya  ^  yb  and  ya  E 
Sai  and  yb  E  Sbi, 

A**  B  =  Xi  $SalnSbl  Xl...  xi . . .  $SannSbn  Xn 

Examples.  [O$0,1’2,3xi  $°  x2]  *2  [2$1,2xi $2,3a?2]  = 
x2  $1,2  xi  $0}2  X2  (see  Figure  5). 


A**  B  =  ya  $5alnSM  Xl...  $SaiuSbi  Xi . . .  $SannSbn  x. 

Example  1.  x\  S0’1’2  x\  $°  x2  *2  x\  $1>2>3  x\  $2  x2  = 

$1,2  ®i  $0,2  (see  Figure  3). 
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Figure  3.  Illustration  of  case  1. 


Figure  5.  Illustration  of  case  3. 

4.  If  both  ya  and  yb  are  variables  and  ya  ^  yb. 

Part  1.  Let  ya  =  xj ,  yb  —  Xk  and  C  —  Saj  O  Sbk . 
For  each  c  E  C  let 

A'  =  c$Sal  xx...  $c  xj...  %San  xn 

B'  =  C  $S61  Xi  .  .  .  $C  Xk  . . .  $Sbn  Xn 

The  consensus  of  A!  and  B'  can  be  treated  as  case  2. 

Part  2.  Let  ya  =  Xj,  yb  =  Xk  and  Ca  =  Sai  n  Saj  n 
Sbj  Cb  =  Sbi  n  Sbk  n  Sak.  For  each  c0  €  Ca  let 

A'  =  Xi  $Sfll  xi...  $c"  $c“  xj...  $San  x„ 


2.  If  both  ya  and  t/6  are  constants  and  ya  =  Vb, 

A**  B  =  ya  $SalnSbl  xi...  $5aiu56,‘  Xi . . .  $SannSbn  x, 

Example  2.  1  $0,1,2  x\  $°  x2  *2  1  $1>2>3  x\  $2  x2  = 

1  $1,2  xi  $0,2  x2  (see  Figure  4). 


For  each  Cb  E  Cb  let 

B'  =  Xi  $Sbl  X!  . . .  $c‘  Xi...  $c‘  Xk...  $56"  xn 

Find  the  consensus  of  each  pair  A',  B'  according  to 
case  1. 
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Figure  6.  illustration  of  case  4. 


Figure  8.  Illustration  of  case  6. 


Example  4.  xi  $0,1’2’3  xi  $0,1  x2  * 2 

ar2$0-1'2>3x1$2>3x2  =  2$2x1$°’1,2x2  #3$3xi$°’1>3x2 
#  x2  $°  Xi  $°'2’3  x2  #  x2  S1  Xi  $1,2,3  x2  (see  Figure 
6). 

5.  If  ya  is  a  constant  and  yb  is  a  variable  (swap  A  and  B 
if  necessary)  and  ya  G  Sal  and  yb  —  Xi, 

A*‘B  =  Xi  $SalnSM  Xl . . .  $y°uSbi  Xi...  $SannS<m  Xn 

Example  5.  0  $0,3  xi  $0,1  x2  *2  x2  $0,1  xi  $2’3  x2  = 
x2  $°  xi  $°-2'3  x2  (see  Figure  7). 
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Figure  7.  Illustration  of  case  5. 


V*1 

x2\  0  12  3 


€ 

_1_ 

3> 

2 

2 

2 

~0~ 

T 

Figure  9.  Illustration  of  case  7. 

Example  7.  2$2,3xi  $0,1  x2  t1  xi  $0,1  xi  $0,2x2  = 
xj  $0,1’2  xi  $°  x2  (see  Figure  9). 

8.  If  ya  is  a  constant  and  yb  is  a  variable  (swap  A  and  B 
if  necessary)  and  ya  G  Sbj  and  yb  —  Xj  (j  ^  i), 

A  **'  B  =ya  $SalnSf>1  ^  $5aiu5fri  ^  _ 
$Sajnya  x.  §SannSbn  Xn 

Example  8.  2$2,3xi  $0,1,2x2  **  x2  $0,1  xi  $0,2x2  = 
x2  S0’1’2’3  xi  $2  x2  (see  Figure  10). 


6.  If  ya  is  a  constant  and  yb  is  a  variable  (swap  A  and  B 
if  necessary)  and  ya  G  Sai  and  yb  =  Xj(j  ^  i)  and  3  k 
bk  G  (Sbj  n  Sbi ) , 

A  **'  B  =xi  $SalnS61  xi . . .  $y°vbk  Xi... 

§SajC\bk  x.  $SanC\Sbn  ^ 

If  multiple  k  exist,  there  will  be  multiple  results. 

Example  6.  2  $2,3  xi  $0,1  x2  +1  x2  $0,1  xi  $0,2  x2  = 
xi  $0,2  xi  $°  x2  (see  Figure  8). 

7.  If  ya  is  a  constant  and  yb  is  a  variable  (swap  A  and  B 
if  necessary)  and  ya  E  Sbi  and  yb  = 

A*'  B  =  J/a  $5alnSM  Xl  .  .  .  $SaiU!/“  Xi  .  .  .  §SannSbn  X, 
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Figure  10.  Illustration  of  case  8. 

9.  Otherwise, 


A  *J  B  =  z. 
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Note  that  the  above  definition  of  consensus  is  based  on 
Law  11,  Law  14,  and  Law  15.  The  consensus  operation  is 
defined  with  respect  to  two  standard  SS-terms.  Terms  in 
non-standard  SS-forms  must  be  transformed  into  standard 
forms  before  a  consensus  operation  is  applied. 

In  some  cases  the  consensus  operation  will  produce  more 
than  one  term.  For  example,  if  the  two  SS-terms  meet  the 
criteria  given  in  case  6,  more  than  one  k  may  exist  and  there¬ 
fore  several  consensus  terms  may  be  generated.  Further¬ 
more,  the  two  SS-terms  may  meet  the  requirements  of  more 
than  one  of  the  cases  described  above.  For  an  illustration 
see  the  example  below. 

Example  9.  Given  two  product  terms  A  =  0$°£i$o’1,2’322 
and  B  =  x 2  $1,2  x\  S0’1,2’3  #2  we  have  A  +1  B  = 
x\  $0,1  x\  S1  x2#  xi  $0,2  x\  $2  X2 #  0  $0’1,2  x\  $°  X2  (see 
Figure  1 1).  The  first  two  products  are  obtained  from  case  6 
and  the  last  one  falls  into  case  8. 
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Figure  11.  Map  for  a  two-variable  function 


Lemma  1.  Let  A  and  B  be  two  SS-terms  at  distance  1  and 
let  i  be  the  dimension  in  which  the  corresponding  Si  do  not 
intersect.  Then  the  only  non-empty  consensus  exists  in  the 
ith  dimension. 

Lemma  2.  Let  A  and  B  be  two  SS-terms  at  distance 
greater  than  1.  Then  the  consensus  between  A  and  B  is 
empty  in  all  dimensions. 

Theorem  1.  Let  A  and  B  be  implicants  of  the  function 
f(X)  for  which  consensus  in  the  ith  dimension  exists.  Then 
C  —  A  *l  B  is  also  an  implicant  of  f(X). 

Proof.  For  each  of  the  9  cases  of  consensus  it  can  be  shown 
that  for  Va:  such  C(x)  /  u,  C(x)  =  A(x)#B(x)  =  f(x). 

□ 


6  Minimization 

In  this  paper  we  only  consider  two-level  minimization  in 
which  each  term  has  the  same  cost.  In  other  words  we  want 
to  a  “sum-of-products”  with  the  minimum  (or  near  mini¬ 
mum)  number  of  terms.  We  implemented  the  classical  two 
step  minimization  algorithm.  First,  all  prime  implicants  are 
generated.  Second,  a  minimal  cover  is  found  among  the 
prime  implicants.  The  algorithm  for  prime  implicant  gen¬ 
eration  follows  directly  from  Theorem  2.  Given  a  list  of 
implicants,  that  cover  the  function,  find  the  consensus  terms 
for  each  adjacent  pair  (according  to  Lemma  2,  we  only  need 
to  consider  terms  at  distance  less  than  2).  Add  it  to  the  list  if 
it  is  not  subsumed  by  any  of  the  terms  in  the  list.  Remove  all 
subsumed  terms.  Repeat  the  procedure  until  no  new  terms 
are  added.  Finally,  find  an  irredunant  cover  from  the  list  of 
prime  implicants. 

Some  results  of  our  implementation  in  C++  are  shown 
in  Table  1.  The  cpu  time  was  measured  on  a  Pentium  PC 
running  a  166  MHz.  All  but  two  of  the  function  in  Table  1 
are  well  known  binary  benchmarks.  We  converted  each  one 
to  a  4-valued  function  by  combining  pairs  of  input  variables 
and  combining  the  first  two  outputs,  “deng”  is  taken  from 
[4]  and  “9varl”  was  constructed  to  show  that  the  algorithm 
can  easily  minimize  function  with  9  input  variables. 

In  the  examples  used  to  test  the  algorithm,  the  prime  im¬ 
plicant  generation  required  significantly  more  computation 
time  than  the  finding  of  an  irrdundant  cover.  It  comes  as  no 
surprise  that  symmetric  functions,  such  as  9sym ,  are  partic¬ 
ularly  difficult  to  minimize.  This  is  due  to  the  fact,  that  they 
often  have  a  large  number  of  prime  implicants  that  can  be 
generated  in  many  different  ways.  In  addition,  an  irredun- 
dant  cover  can  be  obtained  in  many  ways. 

The  effort  required  to  generate  all  prime  implicants  de¬ 
pends  heavily  on  the  initial  input.  As  an  illustration,  we 
minimized  the  function  9sym  using  different  sets  of  terms  as 
input  to  our  program.  The  results  are  summarized  in  Table 
2.  9sym  has  93  prime  implicants.  As  the  terms  in  the  input 
are  closer  to  prime  implicants,  the  time  to  generate  all  prime 
implicants  decreases  dramatically.  We  are  currently  modi¬ 
fying  our  algorithm  to  preprocess  the  list  of  input  terms  to 
reduce  the  number  of  terms.  Heuristics  can  be  used  for  the 
preprocessing,  since  the  iterative  consensus  will  guarantee 
a  list  of  prime  implicants.  Preliminary  results  are  encourag¬ 
ing. 


Theorem  2.  Let  P  be  a  sum  of  products  representation  of 
the  function  f(X).  Repeatedly  remove  from  P  all  SS-terms , 
which  subsume  other  SS-terms  in  P  and  add  to  P  all  SS- 
terms  generated  by  consensus  between  SS-terms  present  in 
P  until  no  new  SS-terms  are  generated.  P  consists  of  just 
the  prime  implicants  of  function  f(X). 

Proof  Similar  to  [3].  □ 


7  Conclusion 

We  have  developed  a  multiple-valued  algebra,  super 
switch  algebra,  and  a  prime  implicant  generation  algorithm 
based  on  it.  This  opens  a  new  approach  to  the  design  of  su¬ 
per  switch  networks,  a  one-pass  algebraic  approach.  Com¬ 
pared  with  the  map-based  procedure  presented  in  [4]  and 
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Function 

n 

Terms 

Time  sec 

deng 

~T 

4 

0.02 

bw 

3 

4 

0.04 

rd53 

3 

2 

0.05 

xor5 

3 

2 

0.01 

5xpl 

4 

2 

0.08 

misexl 

4 

7 

0.09 

rd73 

4 

2 

1.08 

rd84 

4 

1 

0.39 

9sym 

6 

25 

954.00 

sao2 

6 

7 

61.76 

clip 

6 

4 

748.00 

9varl 

9 

5 

0.10 

Table  1.  Computation  time  for  some  benchmark 
functions. 


Terms 

Time  sec 

93 

6 

121 

9 

140 

13 

163 

17 

177 

33 

181 

36 

186 

66 

Table  2.  Minimization  of  9sym  using  differnt  input 
terms. 


subfunction-based  algorithm  presented  in  [3],  our  method 
has  several  advantages: 

•  Due  to  its  algebraic  nature,  it  is  more  suitable  to  the 
synthesis  of  multiple- valued  functions  with  many  vari¬ 
ables. 

•  It  avoids  decomposing  the  function  into  multiple  sub¬ 
functions  and  thus  avoids  producing  the  same  prime 
implicants  in  several  of  the  subfunctions. 

•  The  algorithm  we  proposed  is  easier  to  be  imple¬ 
mented  in  computer  software. 

•  It  is  potentially  more  useful  for  formal  verification  of 
design,  test  generation,  and  design  for  testability 
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Abstract 

After  introducing  the  Method  of  Indeterminate 
Coefficients ,  by  which  we  can  derive  all  finite  models 
satisfying  a  given  set  of  axioms,  we  derive  all  models  of  8 
elements  of  Kleene  algebra  by  using  the  method ,  and  find 
out  many  examples  of  a  set  of  independent  and  complete 
axioms  of  Kleene  algebra  by  checking  whether  each  axiom 
in  Kleene  algebra  is  independent  from  others  or  not  based 
on  the  method ' 

1.  Introduction 

When  fuzzy  sets  theory  was  introduced  by  L.  A. 
Zadeh111  to  treat  numerically  ambiguity  concepts 
concerning  human  thinking,  linguistic  meaning  and  so  on, 
he  used  firstly  three  fundamental  logic  operations 
AND(n),  OR(u)  and  NOT(~)  and  defined  them  as  min, 
max  and  1-  on  the  closed  interval  [0,1],  respectively  Since 
being  introduced  the  logic  operations  AND,  OR  and  NOT 
on  [0,1]  instead  of  two- valued  {0,1},  the  logic  systems 
<[0,l],n,u,~>  was  studied  actively  in  the  beginning 
under  the  name  of  fuzzy  logic.  Although  the  algebraic 
system  corresponding  to  the  fuzzy  logic  <[0,l],n,u,~> 
was  considered  as  De  Morgan  algebra,  in  1981 121  one  of 
the  authors  formulated  firstly  the  fuzzy  logic  system 
<[0,l],n,u,~>  as  Kleene  algebra  which  is  a  stronger 
algebraic  system  than  de  Morgan  algebra.  Kleene  algebra 
is  an  algebraic  system  which  is  a  slightly  weaker  than 
Boolean  algebra,  that  is,  in  Kleene  algebra  the 
complementary  laws  (the  excluded  middle  law 
~  Akj  A  =  \  and  the  contradiction  law  ~  A n  A  =  0 )  do 
not  hold  but  Kleene’s  laws  ~AuA>~AnA  hold, 
where  Kleene's  laws  are  weaker  conditions  than  the 
complementary  laws.  Kleene  algebra  is  a  useful  and 
interesting  algebra  for  treating  ambiguity  or  undefined  or 
contradictory  states,  and  it  is  well  known  that  Kleene 
algebra  corresponds  essentially  to  three-valued  logic.  But 
it  is  not  known  what  kind  of  finite  models  satisfy  the 
Kleene  algebra,  so  it  is  an  interesting  problem  to  find  out 
the  finite  models  satisfying  Kleene  algebra. 

On  the  other  hand  the  Method  of  Indeterminate 
Coefficients  was  developed  by  M.  Goto[3H6]to  determine 


the  truth  tables  of  undefined  operators  included  in  axioms 
of  multiple-valued  logic  systems.  Essentially  it  is 
considered  that  the  Method  of  Indeterminate  Coefficients 
is  a  method  to  find  out  finite  models  satisfying  given 
axioms.  Therefore  the  Method  of  Indeterminate 
Coefficients  is  applicable  to  find  out  finite  models 
satisfying  an  algebraic  system  when  the  axioms  are  given. 
That  is,  it  is  a  strong  tool  to  proof  an  axiom  is  independent 
from  other  axioms  (the  designate  axiom  is  not  derived 
from  other  axioms),  because  we  can  apply  the  method  to 
find  out  a  model,  which  is  a  counter  example,  satisfies  the 
set  of  other  axioms  but  does  not  the  designate  axiom.  In 
this  paper  after  introducing  the  Method  of  Indeterminate 
Coefficients,  we  derive  all  models  of  8  elements  of  Kleene 
algebra  by  using  the  method  ,  and  find  out  many  examples 
of  a  set  of  independent  and  complete  axioms  of  Kleene 
algebra  by  checking  whether  each  axiom  in  Kleene 
algebra  is  independent  from  others  or  not  based  on  the 
method. 

2.  Axioms  of  Kleene  Algebra 


Table  1 .  Representative  equations  valid  in  Kleene  algebra 


(I)  the  commutative  law  (a) 

(1) ’the  commutative  law  (b) 

(2)  the  associative  law  (a) 

(2) ’the  associative  law  (b) 

(3)  the  absorption  law  (a) 

(3) ’the  absorption  law  (b) 

(4)  the  distributive  law  (a) 

(4)  ’the  distributive  law  (b) 

(5)  the  idempotent  law  (a) 

(5) ’the  idempotent  law  (b) 

(6)  De  Morgan’s  law  (a) 

(6) ’De  Morgan’s  law  (b) 

(7)  the  double  negation  law 

(8)  the  least  element  (a) 

(8) ’the  least  element  (b) 

(9)  the  greatest  element  (a) 

(9)  ’the  greatest  element  (b) 

(10)  the  Kleene’s  law  (a) 
(10)’the  Kleene’s  law  (b) 


XuY=YuX 
X  r\Y  -YnX 
Xu(YuZ)  =  (XuY)uZ 
Xn(Yr\Z)  =  (XnY)r\Z 
X  u(X  n/)  =  X 
Xn(XuY)  =  X 
Xn(YuZ)  =  (Xr\Y)u(XnZ) 
Xu(YnZ)  =  (XuY)n(XvZ) 
XuX  =  X 
Xr\X  =  X 
~(XuY)=~  Xn~Y 
~(Xr\Y)=~  Xu~Y 

QuX  =  X 
0nX  =  0 
\uX  =  \ 

\nX  =  X 

(Xu  ~X)u(Yu~Y)  =  Yu~Y 
(Xn  ~X)n(Yu~Y)  =  Xn~X 


An  algebraic  system  <K,n,u,~,l,0>  is  Kleene 
algebra  when  a  set  K  has  at  least  two  elements  1  and  0 
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and  the  operators  n,  u  and  ~  satisfy  all  equations  of 
Table  1  for  any  elements  X,YfZ  of  K.  We  call  these 
equations  as  axioms  of  Kleene  algebra[2[,[7],  but  these 
equations  are  not  independent  each  other. 

3.  The  Method  of  Indeterminate  Coefficients 

3.1.  Truth- Values  and  Symbols 

In  this  paper,  we  use  the  same  following  definitions  as  in  the 
preceding  papers[2H6]. 

3.1.1.  Truth-Values 

X,  Y,  Z  are  logical  variables,  which  can  take  any 
elements  on  a  set  of  K  of  truth  values,  and  also  take  the 
truth  value  of  “Greatest”(‘Truth”)  1  or  “Least”(wFalse”)  0 
in  case  of  two-valued  logical  algebraic  system.  Generally, 
in  case  of  n0-valued  logical  algebraic  system,  X,Y,Z  take 
one  of  the  sets  of  values  K={l,2,--%  n0}  where  1,  n0are 
designated  as  “Greatest”(“Truth”),  “Least”(“False”)  and 
others  are  as  “Intermediate”,  respectively. 

“ X  takes  the  value  jc”  is  represented  by 
X=x  (3.1. 1.1) 

and  simply  by  the  two- valued  expression  A*.  Then 
Xx  =  (X  =  x)  (3.1. 1.2) 

, where  symbol  s  is  referred  to  in  the  next  (1)  in 
Section  3.1.2. 

It  means  that  the  truth-value  of  (3. 1.1.2)  is  l(Truth)  if  X 
takes  the  value  x  and  it  is  O(False)  if  X  doesn’t  take  the 
value  x.  And  then,  the  following  equations  are  obtained. 


about  two- valued  logic,  X]  vX°  =1 , 
where  X 1  -X°  =0  (3.1. 1.3) 

about  three-valued  logic,  X1  v  X2  v  X3  s  1 , 
where  X]  -X2  =  X' -X3  m X2  X*  s0  (3.1. 1.4) 
about  n0- valued  logic  X1  v X2  v  X"0  =1, 
whereA"1  *X}  =0  for  Vi,j;i*j  (3.1. 1.5) 


Of  course,  the  values  1  and  0  in  the  right  side  of 
s=  mean  Truth  and  False  of  two-valued  logic,  where  the 
symbols  v,*  means  classical  two- valued  logic  operators 
which  is  defined  in  the  next  (1)  in  Section  3.1.2. 

3.1.2  Symbols 

Operators  which  are  used  for  discussion  (in  a  meta¬ 
logic)  are  of  course  for  two-valued  logic  and  operators 
which  are  under  research  (in  an  object-logic)  are  for 
multiple-valued  logic  in  this  case  and  these  two  operators 
must  be  clearly  distinguished.  That  is,  in  case  both  logic 
are  treated  together,  each  operator  for  both  logic  is 
described  as  follows. 

(1)  Operators  for  Classical  Two- valued  Logic 

Ordinarily  used  operators  for  classical  two- valued  logic 


are  as  shown  in  the  followings,  where  X,Y  €  {0,1} . 
Negation  X  =  1  -  X , 

Disjunction  X  v  Y  =  max(A",  Y) 

Conjunction  X  a  Y  or  X  •  Y  (if  not  confused,  it  can  be 
omitted)  =  min^,  Y) , 

Implication  X  Y  =  X  v  Y , 

Equivalence  X  <->  Y  =  (X  Y)-(Y  X) , 

Identical  Equivalence  X  s  Y ,  which  means  that  “X 
and  Y  are  equivalent  for  every  value  of  the  set”. 

Equal  X=Y,  an  operator  which  represents  equation  of 
algebra. 

(2)  Operators  for  Multiple- valued  Logic 

n,  u  and  ~  are  operators  on  multiple- valued  logic 
and,  those  are  treated  as  undefined  or  unknown  operators 
at  first  and  the  truth  tables  will  be  determined  through  the 
Method  of  Indeterminate  coefficients. 

3.2.The  Method  of  Indeterminate  Coefficients 

3.2.1. Representation  of  Undefined  Operators 

n ,  u  are  defined  as  two  variable  operators  and  ~  is  as 
a  single  variable  operator.  Assuming  that  the  formers  are 
the  unknown  logical  functions  for  two  variables  and  the 
latter  is  the  unknown  logical  function  for  one  variable, 
names  of  functions  are  represented  as  DIS,  CON,  NEG.  In 
this  case,  using  the  indeterminate  coefficients,  SJmn ,  CJmn , 
NJm  which  take  two- value  0  or  1,  those  functions  are 
expressed  as  follows. 

(Xu  Yy=DIS(Xjy=ZZSi„-X”'Y".  (3.2.1. 1) 

m=l  /i=) 

(XnYy=CON(X,Yy=  V  vCV  •A'"  /”.  (3.2.1.2) 

m=l  «=1 

(~  X)J  =  NEG(Xy  =  v  Ni  ■  X”  .  (3 .2. 1 .3) 

m=I 

,  where  Sf„n  is  1  if  DIS( m,  n)=j  is  true  and  0  if  DIS(m, 
n)=j  is  false.  It  is  similar  for  CJmn  and  NJm  ,  respectively. 

(Example) 
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The  three- valued  logical  function  F(X,Y)  of  two 
variables  in  Table  2(a)  is  represented  by  F(X,  Y)},  F(X}  Y)2 
and  F(XfY)3  in  Table  2(b)^(d),  respectively.  Only  0  and  1 
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appear  in  those  tables.  In  this  case,  for  example,  the 
following  three  equations  are  obtained. 

F(X,Y)'  =  X'Y3vX3-Y3vX3-Y3 
F(X,Y)'-  =  X'  Y2  v  X- -Y'  v  X3  -Y3  (3.2.1.4) 

F(X,Y)3  =  X'  •  Y'  v  X3  -r  v  X3  ■  Y3 
And  then,  the  following  two  conditions  are  concluded 
in  case  we  use  F3„  as  the  value  of  X’n  ■  Y"  in  F(X ,  Y)J 
(1  <  y 3) . 

Fl  vFm2„  vFL  =1, 

F'  ■ F 2  =F2  F3  =  F'„  •  F3 =  0,  (1  <  m,n  <  3) 

1  mn  J  mn  *  mn  mn  mn  mn  5  v  9  f 

(3.2.1.5) 

Generally,  for  example,  when  the  three-valued  logical 
function  F(X,Y)  with  two  unknowns  is  represented  by 
Table  2(e)  where  Fm„  e  {1,2,3}  then  the  following 
expression  is  obtained. 

F(X,Y)J  =F,{  ■ X 1  • Y 1  vFi{  • X '  ■ Y 2  v— vF,}  Jf3  T3 
a =u,3) 

(3.2. 1.6) 

,  where  Fm;„  takes  only  one  of  the  values  0  or  1,  and 
v  Fin  ~  1>  F>n-F*n=Hj*k)  (3.2. 1.7) 

That  is,  when  F^l?  is  an  unknown  logical  function 
and  the  value  of  FJ„  will  be  determined  as  0  or  1 ,  F(X,  Y) 
will  be  determined.  The  Method  of  Indeterminate 
Coefficient  is  a  method  which  determinates  the  value  of 
FJ,  on  condition  such  that  a  given  set  of  equations  is 
satisfied.  In  this  case,  for  example,  if 
F'  =F*  =0  F  =Fl  =0  is  determined,  then 
F^n  =  1 ,  or  if  F„n  =  1  is  determined,  then 
FL=Fi=  0. 

3.2.2  Expression  of  the  Axioms  of  Kleene  Algebra  using 
the  Method  of  Indeterminate  Coefficients 

Assuming  that  each  equation  of  Table  1  is  treated  as 
tautology,  equation  of  Table  1,  (1)  the  commutative  law(a) 
is,  for  example,  expressed  as  follows. 

[iuf=yui]'Ei 

From  this  equation,  clearly,  X  vjY  and  Y  u  X  take 
the  same  value  j  for  any  truth  value  of  X  and  Y.  Then, 
DIS(X,Y)i  and  DIS(Y,X)i  are  satisfied  simultaneously, 
where  takes  any  value  of  the  values,  1,2,*”  ,  n0.  We 
obtain; 

3je{l,2,-,n0\;DlS(X,Yy  -DJS(Y,Xy  =1.  (3.2.2.2) 

,  where  takes  only  one  value  of  {1,2, ...n0}, 
simultaneously,  then  the  equation  (3.2.2.2)  is  expressed  as 
follows. 

y  DIS{XJ)J  •  DIS{Y,X)J  =1 
/but  vi  *  j\DlS‘  ■  D1SJ  =  0  (3. 2.2.3) 


Replacing  DIS  with  SJm  ((3.2.1 . 1))  in  above  equation  , 
the  following  equation  can  be  obtained. 

v  S£y  -S^  -Xx  -y  =  1,  (3.2.2.4) 

fhis  equation  must  always  be  satisfied  when  logical 
variables  X  and  Y  take  any  truth  values  x  and  y.  And  then, 
the  equation  (3.2.2.4)  is  transformed  into  the  equations 
(3.2.2.5)  which  is  equivalent  to  (3.2.2.6). 

HSj'.Si-X'.YynX,  (3.12.5) 

^AV  Si-Si- 1-  (3-2-2-6) 

Therefore,  clearly,  the  truth  table  about  U ,  which 
satisfies  the  equation  (1)  of  Table  1  ,  can  be  determined 
from  the  undefined  two-valued  coefficients  by 
solving  the  equation  (3.2.2.6). 

Simultaneously,  all  equations  of  Table  1  are 
transformed  into  equations  of  Table  3  using  the  undefined 
two- valued  coefficients  C’„4,  NJm  in  the  equations 

(3.2.1. 1)~(3.2.1.3),  where  x,  y,  z  mean  the  value  of 
independent  variables  X,  Y,  Z  and  j,  k,  1,  m  mean  those  of 
terms  DIS,  CON,  NEG. 

3.3  Algorithm  for  the  Method  of  Indeterminate 
Coefficients 

In  this  algorithm,  at  first  we  put  “2”,  which  means 
indeterminate,  into  all  coefficients  SJm,Ci„  and  N3,. 
Next,  we  determinate  the  indeterminate  coefficients  “2”  to 
determinate  value  “0”  or  “1”  in  order  to  satisfy  each 
equation,  one  after  another.  That  is,  at  first,  in  the  no- 
valued  logic,  we  determinate  the  coefficients  in  order  to 
satisfy  first  equation  about  all  sets  of  values  for  (x,  y,  z); 
from  (1,1,1)  to  (n0,  n0,  n0)  in  case  of  three  variables,  where 
some  coefficients  are  not  determined,  i.e.  those  values  are 
still  “2”.  And  then,  we  determinate  those  indeterminate 
coefficients,  which  are  “2”,  by  next  equation.  In  this  time, 
when  a  solution  of  the  new  equation  shows  a  coefficient 
“1(0)”,  which  was  determined  to  “0(1)”  in  the  previous 
equations,  such  a  solution  should  be  cancelled  as  a 
contradiction. 

Similarly  the  above  process  should  be  repeated  for 
every  set  of  values  for  x,  y,  z  about  all  equation  of  a  set  of 
axioms.  Finally,  any  solutions  obtained  during  such 
process  satisfy  all  equation  of  the  axioms,  which  are 
models  satisfying  the  given  all  axioms.  Let  us  call  the 
solutions  during  the  process  “Provisional  solutions”.  Such 
algorithm  is  described  in  detail  as  follows. 

3.3.1.The  Program  Based  on  Wide-spread  Algorithm 

We  apply  a  wide-spread  algorithm  to  each  equation, 
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one  after  another,  in  order  to  determinate  the  coefficients 
by  solving  the  simultaneous  logical  equations. 

The  procedure  to  apply  such  algorithm  is  as  follows. 

Procedure  1.  Solving  the  first  logical  eqaution. 

Procedure  1-1.  We  will  get  all  “Particular  solutions”  which 

satisfy  the  first  equationiQr..(^  .y<  zHL  1.  IX 

Procedure  1-2.  We  will  get  new  particular  solutions  about 

the  first  equation  for  next  values  of  (xr  yt  7)  and  get  new 

provisional-solutions. 


This  process  is  repeated  from  (x,  y,  z)=(l,  1,  2)  to  (n0, 
no,  no). _ 


Table  3.  Equation  of  Table  1  transformed  by  undefined  coefficients 

(1)  A 

x.y-l  j-\  v  * 

(3.2.17(1)) 

(1)’  A  vC' -C'  ■  1 

(3 .2.2. 7(1)’) 

(2)  A  0  s' -s* -s^-s;- 1 

(3.2.2. 7(2)) 

(3.2.2. 7(2)’) 

(3.2.2.7(3)) 

(3)’  A  vC;-S>.  =  ! 

v  7  Jr.j-1  7-1  *J  } 

(3.2.2. 7(3)’) 

(4)  A  C  Ci-Sl-Si-C-CSs  1 

(3.2.2. 7(4)) 

(3.2.2.7(4)*) 

(5)  a  =  1 

(3.2.2.7(5)) 

(5)’aC'.1 

(3.2.2.7(5)’) 

<«>  LLn‘>-s»-c*n'n^1 

(3.2.2.7(6)) 

/*— ■ s 

o\ 

i> » 
!<• 

.0 

£ 

ja: 

III 

(3.2.2.7(6)’) 

(7)  avV;-V/s1 

(3.2.2. 7(7)) 

(8)  aS^.1 

(3.2.2.7(8)) 

(8)'aC;,| 

(32.2.7(8)’) 

(9)  aS!,*1 

(3.2.2. 7(9)) 

(9)’aQ  =  1 

(3.2.2. 7(9)’) 

(10)  A  V  SlrC‘rN',-S{.-N;  =  \ 

(3.2.2.7(10)) 

(10)’  a  C  CL-Q-Ni-SL -N’el 

(3.2.2.7(10)’) 

second  equation  of  Table  3  to  repeat  the  procedure  1-1. 
Those  solutions  become  the  general  solutions  about  two 
equations  to  be  treated. 

Procedure  ..3^e.-will.geLgeneral  solutions  about  the  .scurf 

axioms  of  Kleene  algebra . 

Repeating  above  the  process  to  solve  all  equations  of 
Table  3,  we  will  get  all  solutions  about  the  set  of  axioms 
of  Kleene  algebra,  where  each  solution  corresponds  to  a 
model  of  kleene  algebra. 

3.3.2  The  Program  Based  on  Depth-first  Algorithm 

In  the  program  based  on  a  depth-first  algorithm,  at  first, 
only  one  solution  which  satisfy  all  equations  of  the  set  of 
axioms  is  derived.  For  that  purpose,  the  coefficients  S ,  C 
and  N  in  each  equation  of  the  set  of  axioms  must  be  solved 
in  order  to  satisfy  each  equation  for  every  set  of  values 
about  x,y,z  and  some  set  of  values  about  j,k,l,m.  By 
storing  the  each  set  of  values  about  j,k,l,m  for  all  sets  of 
values  about  x,y,z  in  each  equation  of  an  set  of  axioms, 
another  solutions  except  the  first  one  is  derived  efficiently. 
This  method  have  the  following  two  merits. 

(1)  About  the  size  of  the  program 

The  size  of  this  program  based  on  this  algorithm  is 
smaller  than  that  based  on  wide-spread  algorithm,  because 
only  one  provisional  solution  is  treated  always.  And  it  is 
not  necessary  accessing  any  file  on  the  disk  memory. 
Therefore,  in  the  many-valued  algebraic  system  which 
can’t  be  treated  by  the  program  based  on  wide-spread 
algorithm  for  limitation  of  the  quantity  of  the  disk  memory, 
we  can  derive  all  solutions  without  considering  the  size  of 
program  and  the  disk  memory. 

(2)  About  deriving  the  solutions 

If  only  one  solution  is  needed,  we  can  derive  more  fast 
by  this  program  than  by  the  program  based  on  wide-spread 
method. 

For  example,  we  can  obtain  all  models  of  a  Kleene 
algebra  having  eight  elements  by  using  the  program  on 
Depth-first  Algorithm.  The  Figure  2  shows  all  that  models. 
In  the  next  section  we  will  show  an  application  of  this 
program  of  wide-spread  method. 


Procedure  2.  Solving  the  second  logical  equation. 

And  then,  we  will  get  provisional  solutions  about  the 
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Figure  1 .  The  Models  of  Kleene  Algebra,  n=8 

4.  Independent  and  Complete  Axioms  for 
Kleene  Algebra 

4.1.  The  Independence  of  Each  Axioms  for  Kleene 
Algebra 

The  independence  of  each  axiom  for  an  algebraic 
system  can  be  examined  by  applying  the  Method  of 
Indeterminate  Coefficients,  in  which  the  program  is  based 
on  wide-spread  method,  to  the  equations  of  its  set  of 
axioms.  For  example,  if  we  can  find  a  new  finite  model 
of  Kleene  algebra,  during  the  treatment  of  the  last  axiom, 
the  independence  of  the  last  axiom  from  the  set  of 
previous  axioms  is  proved  automatically  because  the 
derived  new  model  is  a  counter  example  showing  the  last 
axiom  is  independent  from  the  set  of  previous  axioms,  and 
the  last  axiom  become  a  candidate  for  complete  axioms. 
Therefore,  if  we  would  like  to  decide  whether  an  axiom  is 
independent  or  not,  it  is  sufficient  to  treat  its  equation  at 
last  in  a  set  of  axioms  in  the  program  of  wide-spread 
method.  For  example,  Table  4-8  show  counter  examples 
showing  each  axiom  is  independent  from  set  of  other 
axioms,  respectively.  Thai,  it  was  proved  that  cme  of  the 
commutative  law  (a)  or  (b),  (me  of  the  distributive  law  (a) 
or  (b),  one  of  the  De  Morgan’s  law  (a)  or  (b),  the  double 
negation  law,  and  one  of  the  Kleene’s  law  (a)  or  (b)  are 
independent  from  others,  respectively. 

Table  4:  An  example  which  indicates  that  the  commutative  law  is 
independent. 

1  2  3  4  I  1  2  3  4  X|  -X 
11111  11234  1  4 

21222  21234  2  3 

31234  33333  3  2 

41234  44444  4  1 

Xu Y  XnY 


Table  5:  An  example  which  indicates  that  the  distributive  law  is 
independent 

1  2  3  4  5  I  1  2  3  4  5  X|  ~X 
111111  1  1  2345  1  5 

212112  222555  2  4 

311313  335355  3  3 

411144  445545  4  2 

512345  555555  5 

Xu Y  XnY 

Table  6.  An  example  which  indicates  that  the  De  Morgan’s  law  is 
independent. 

12  3;4  I  1  2  3  4  X|  -X 
11111  11234  1  2 

21232  22232  2  1 

31333  33234  3  4 

41234  44444  4  3 

Xu Y  XnY 

Table  7.  An  example  which  indicates  that  the  double  negation 
law  is  independent. 

I  2  3  1 1  2  3  x|  ~x 

1111  112  3  II 

2122  2223  2  1 

3  1  2  3  3  3  3  3  3  1 

Xu  Y  XnY 


Table  8.  An  example  which  indicates  that  the  Kleene's  law  is 
independent. 

12  3  4  I  1  2  3  4  x|  -X 
111)1  11234  1  4 

21212  22244  2  2 

31133  33434  3  3 

41234  44444  4 

Xu Y  XnY 

In  case  of  finite  Kleene  algebra,  1  and  0  are  the 
greatest  element  and  the  least  element  by  expressing  the 
following  equations,  respectively. 

1  =  X,  u  X1  u  ■  ■  •  u  X,  ,0  =  X,  n  X:  n  •  •  •  n  X„ 
for  X, ,  X 2 ,  •  •  • ,  Xn  e  the  set  of  all  elements  i 

of  finite  Kleene  algebra  j 


Figure  2. 

Clearly,  in  case  of  finite  Kleene  algebra,  the  least 
element  and  the  greatest  element  are  not  independent.  But, 
in  case  of  infinite  Kleene  algebra,  for  example,  a  model 
expressed  in  Figure  2  satisfies  all  axioms  except  the  least 
element  and  the  greatest  element.  Then,  at  least,  one  of  the 
least  element  (a)  or  (b)  or  the  greatest  element  (a)  or  (b)  is 
independent  from  another  axioms  in  Table  1. 

The  associative  law  (a)  and  (b),  the  absorption  law  (a) 
and  (b),  and  idempotent  law  (a)  and  (b)  are  derived  from 
another  axioms  in  Table  1P).  Then,  those  axioms  are  not 
independent. 


4.2.  Independent  and  Complete  Axioms  for  a 
Kleene  Algebra 
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We  can  find  all  independent  axioms  in  Table  1  by 
applying  the  Method  of  Indeterminate  Coefficients  (Table 
9).  They  are  candidates  for  independent  and  complete 
axioms  of  Kleene  algebra.  Here,  we  try  to  derive  examples 
of  independent  and  complete  axioms  of  Kleene  algebra 
from  the  set  of  candidate  axioms. 

All  sets  of  axioms  included  one  of  the  least  law  (b)  or 
the  greatest  law  (a)  but  not  included  the  least  law  (a)  and 
the  greatest  law  (b)  are  not  complete.  For  example,  a 
operator  table  (Table  10)  satisfies  the  set  of  axioms  in 
Table  11,  but  is  not  a  model  of  the  Kleene  algebra. 
Therefore,  the  set  of  axioms  in  Table  1 1  is  not  complete. 

If  a  candidate  of  independent  and  complete  set  of 
axioms  includes  the  least  element  (a),  then  the  greatest 
element  (b)  can  be  derived  from  the  definition  ~1=0,  the 
De  Morgan’s  law  (b)  and  the  double  negation  law. 

Table  9.lndependent  axioms  in  table  1 

{ 1 }  one  of  the  the  commutative  law  (a)  or  the  commutative  law  (b) 

{2}  one  of  the  the  distributive  law  (a)  or  the  distributive  law  (b) 

{3}one  of  the  De  Morgan’s  law  (a)  or  De  Morgan’s  law  (b) 

{4} the  double  negation  law 

{5}one  of  the  the  least  element  (a)  or  the  greatest  element  (b) 

{6}one  of  the  the  Kleene’s  law  (a)  or  the  Kleene’s  law  (b) _ 


Table  12.  A  independent  and  complete  set  of  axioms  of  Kleene 
algebra121 


{ 1 }  the  commutative  law  (a)  X  U  Y  =  Y  U  X 

{2}  the  distributive  law  (a)  X  Ci(Y  u Z)  =  (X  C\Y)U(X  nZ) 

{3}  De  Morgan’s  law  (a)  ~  (X  u  Y)  =~  In  ~  Y 

{4}  the  double  negation  law  ~  (~  X)  =  X 

{5}  the  least  element  (a)  0  U  X  =  X 

{6}  the  Kleene’s  law  (a)  (Xu  ~  X)  u(Ku~  7)  =  7u~  Y 


5.  Conclusion 

It  is  shown  that  the  Method  of  Indeterminate 
Coefficients  is  very  useful  to  find  out  finite  models,  and 
therefore  to  find  independent  and  complete  sets  of  axioms. 
Using  a  set  of  axioms  of  Kleene  algebra  as  an  example, 
we  can  show  32  independent  and  complete  sets  of  axioms 
of  Kleene  algebra  in  total  by  the  Method  of  Indeterminate 
Coefficients. 

In  the  near  future,  we  would  like  to  apply  this  method 
to  clarification  of  the  fundamental  properties  of  such 
another  algebra. 


Table  10.  An  example  of  an  operator  table,  which  satisfies  a  set 
of  axioms  in  Table  11. 


_X 

~X 

1 

1 

3 

3 

1 

1 

3 

3 

1 

2 

2 

3 

2 

3 

2 

3 

2 

3 

2 

1 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

XuY  XnY 

Similarly,  the  greatest  elsement(b)  can  be  derived  from 
the  definition  ~0=1,  the  De  Morgan’s  law  and  the  double 
negation  law.  Then,  only  one  of  the  least  element  (a)  or  the 
greatest  element  (b)  is  independent.  From  the  results  4.1, 
all  sets  of  axioms  in  Table  9  are  independent  from  another 
axioms  in  Table  1.  Therefore  all  independent  and  complete 
sets  of  axioms  for  Kleene  algebra  must  include  those  six 
axioms  in  Table  9.  In  fact,  it  was  proved121  that  the  set  of 
axioms  of  Table  12  is  an  independent  and  complete  set  of 
axioms  of  Kleene  algebra.  It  means  that  all  axioms  in 
Table  1  can  be  derived  from  the  set  of  axioms  in  Table  12. 
In  fact,  by  using  the  results  of  reference  (2),  although  here 
the  proofe  are  omitted  because  of  the  space  limit,  we  can 
prove  that  the  32  sets  of  axioms  composed  from  one  of  the 
axioms  1-6  in  Table  9  are  independent  and  complete  sets 
of  axioms  of  the  Kleene  algebra. 

Table  11 .  Aset  of  axioms  not  satisfied  Kleene  algebra 

{ 1 }  the  commutative  law  (a)  Xu  Y  -Yu  X 

{2}  the  distributive  law  (a)  X  n  (Y  U  Z)  =  ( X  n  Y)  u  ( X  n  Z) 

{3}  De  Morgan’s  law  (a)  ~  (X  u  T)  =~  Xn  ~  Y 

{4}  the  double  negation  law  ~  (~  X)  =  X 

{5}  the  least  element  (b)  0  n  X  -  0 

{6}  the  Kleene’s  law  (a) _ (Xu  ~  Jf)u(Ku  ~  Y)  =  Yu  ~  Y 
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Abstract 

Cascade  function  is  a  Boolean  function  which  can 
be  implemented  by  a  so-called  cascade  network.  An  n 
input  cascade  network  is  a  circuit  built  with  n  —  1  two- 
input- one- output  gates  (i.e.,  dyadic  operations)  such 
that  at  least  one  input  of  each  gate  is  a  network  in¬ 
put.  By  arranging  the  inputs  in  a  proper  order  these 
networks  can  be  presented  in  a  “cascade”  shape ,  which 
is  the  origin  of  the  name. 

Cascade  networks,  and  thus  cascade  functions  have 
many  interesting  properties.  Although  the  portion  of 
cascades  among  all  logic  functions  is  small  (the  ratio 
approaches  zero  with  growth  of  the  number  of  vari¬ 
ables),  their  remarkable  properties  and  the  practical 
significance  of  such  networks  has  attracted  many  re¬ 
searchers  in  the  fields  of  switching  functions  and  logic 
design. 

There  are  several  papers  published  that  focus  on 
the  enumeration  problem  of  cascade  functions  and  net¬ 
works.  Asymptotic  expression  and  a  recurrence  relation 
have  been  found  for  the  number  of  all  cascades  ([12]), 
as  well  as  for  some  subclasses  (see  e.g.  [13,  15]).  How¬ 
ever,  to  the  author's  knowledge ,  until  now  no  closed 
formula  has  been  discovered  which  explicitly  counts 
such  functions. 

This  paper  presents  an  explicit  formula  for  the  num¬ 
ber  of  all  n-variable  cascade  functions. 


1.  Preliminaries 

Let  B  =  {0, 1}.  An  n-variable  Boolean  function  is  a 
map 

f(x :  B  — +  Bn. 

Denote  by  fi(n)  the  set  of  all  n-variable  Boolean  func¬ 
tions.  Put  fl  =  Ya^=i  ^(n)- 


The  variable  xl  of  the  function  f(x\, . . . ,  xn) 
is  called  essential ,  if  there  exist  values 

ali  •  *  •  >  an— 1?  an+l>  •  •  •  > 

f{p>  1>  *  •  •  ?  Gn~i,  0,  ftn+1?  •  *  •  »  an)  7^ 

fig  1>  •  •  *  5  1?  1)  G’n-f  ■  *  •  > 

Function  /  G  fi(n)  is  called  essential  function  if  all  its 
variables  are  essential.  By  fte(n)  we  denote  the  set  of 
all  n-variable  essential  functions. 

The  number  E{n)  =  |f2e(n)|  of  all  n-variable  essen¬ 
tial  functions  are  easily  computed  by  - 

i=o  '  ' 

The  notion  of  formula  over  the  subset  A  C  ft  is  in¬ 
ductively  defined  by  superposition  of  functions  from  A , 
as  well  as  the  operations  of  identification  and  permu¬ 
tation  of  the  variables.  Any  functions  /  €  Q(n)  can  be 
expressed  by  a  formula  over  the  set  f2(2),  which  means 
the  latter  is  complete  in  Ct. 

Any  function  f(x\, . . .  ,xn)  G  Q(n)  has  infinitely 
many  formal  expressions,  also  called  decompositions 
over  any  complete  domain  A.  An  important  prob¬ 
lem  for  logic  functions  is  to  find  an  optimal  expres¬ 
sion  over  a  given  domain.  In  classical  Logic  Design 

2)  and  its  certain  complete  subsets  are  most  widely 
chosen  domains  of  basic  operations,  i.e.,  considering 
combinational  circuits  that  are  built  using  two-input- 
one-output  gates. 

The  general  problem  of  optimal  decomposition  for 
arbitrary  function  is  known  to  be  hard.  However,  the 
study  of  certain  classes  of  “easily  decomposable”  func¬ 
tions  is  important  for  a  better  understanding  of  the 
general  optimization  problem.  Also,  networks  and  cir¬ 
cuits  that  realize  these  functions  have  remarkable  prop¬ 
erties,  such  as  simplicity  of  synthesis,  reliability,  facility 
of  fault  detection  etc.,  which  makes  their  study  signif¬ 
icant  from  a  practical  point  of  view. 
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Xin  O 


Figure  1.  Cascade  circuit 


Among  “most  decomposable”  functions,  those  which 
can  be  formally  expressed  through  the  least  possi¬ 
ble  number  of  basic  operations  are  naturally  included. 
Considering  n- variable  essential  functions  and  0(2)  as 
the  domain  of  operations  this  minimal  number  of  gates 
is  n  —  1.  Such  functions,  i.e.,  those  realizable  by  n- 
input  (n  —  l)-gate  combinational  circuits  are  called  or- 
tholinear  [17],  and  such  circuits  are  called  disjunctive 
networks  [12]. 

The  focus  of  our  study  here  is  a  subclass  of  ortholin- 
ear  functions,  which  can  be  realized  by  so  called  “cas¬ 
cades”  (see  Figure  1). 

Definition  1  Function  f(x i, . . .  ,xn)  6  Oe(n)  is  called 
n- cascade,  or  simply  cascade  if  there  is  a  permutation 
7 r  =  (ii, . . . ,  in)  and  n  -  1  operations  <^i, . . . , <pn-i  € 
0(2)  ,  such  that 

f(x  i, . . . ,  Xn)  =  ipn-l(xin,  » <Pn- 3  •  •  * 

.  .  .  <Pl(Xi2,  Xfi))  ••.))■ 

Let  T(n)  be  the  set  of  all  cascade  functions  in  Oe(n). 
Put  C(n)  :=  |T(n)|  -  the  number  of  all  n-cascades. 

The  properties  of  cascades  have  been  studied  for 
many  years  and  by  many  researchers  ([1]  -  [13]).  Sev¬ 
eral  papers  (e.g.  [3,  6,  9,  11,  12,  13])  have  focused  on 
the  number  of  all  cascades  and/or  their  special  types. 
Particularly,  J.T.Butler  [12]  established  a  recurrence 
relation 

C(n)  =  ]T  (-l)i+1  Q  (2<+1  +  1  )C(n  -  i)  ^ 

-  (— l)n(2n  +  l)C(l), 
as  well  as  an  asymptotic  approximation  - 
C(n)  ~  n\anb  for  large  n, 


where  a  =  4.04095  •  •  • ,  b  =  0.28790 

Expression  (2)  renders  a  simple  algorithm  for  com¬ 
putation  of  the  number  C(n).  This  algorithm  was  im¬ 
plemented  in  a  computer  program  and  the  list  of  first 
15  numbers  for  C(n)  appeared  in  [12]. 

In  spite  of  the  substantial  progress  so  far  in  un¬ 
derstanding  and  enumerating  cascade  functions  there 
has  been  no  explicit  formula  discovered  for  the  number 
C(n).  The  main  goal  of  this  paper  is  to  prove  such  a 
formula,  and  thus  to  explicitly  solve  the  enumeration 
problem  for  this  class  of  logic  functions. 

Before  proceeding  to  the  main  result  presented  in 
the  next  section,  a  few  more  definitions  are  needed. 

Definition  2  Function  f(x i, . . .  ,xn)  €  fie(n)  is  called 
(n,k)-cascade,  1  <  k  <  n,  if  there  exists  a  subset  of  the 
variable  set  X1  =  £  X  —  {xi , . . . ,  xn} 

and  a  subfunction  g(X  \  Xf),  such  that 

f(xi,  ...,Xn)  =  Vk(Xik ,  <fik- 1  («<*_!  ,  •  •  •  V\(xh 
for  some  operations  ipi, . . . ,  <pk  €  f2(2). 

Note,  that  the  position  of  g  is  significant,  i.e.,  not  ev¬ 
ery  k  4*  1-cascade  function  h(g ,  x^ , . . . ,  xik )  is  an  (n,  fc)- 
cascade. 

Note  also,  that  for  the  sake  of  formal  consistency  of 
previous  definitions  a  cascade  function  could  have  been 
defined  in  a  more  general  way,  to  be 

f{x  1, .  •  *  ,#n)  ”  tyniXin  >  tyn— 1  (•  •  •  ^2(^2  >  ^l(^ii)  •••))> 

for  a  monadic  operation  < p\  £  tt(l)  and  some  n  —  1 
dyadic  operations  <p2> . .  • ,  <fn  £  ^(2).  It  is  easy  to  see 
though  that  the  two  definitions  produce  the  same  class 
of  functions,  as  the  existence  of  <£2(^2  >  V?i  (^ii ))  induces 
the  existence  of  and  vice  versa. 
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Clearly,  an  (n,  n)-cascade  is  simply  a  cascade  func¬ 
tion.  Moreover,  since  all  one-  and  two- variable  essential 
functions  are  cascades,  any  (n,n  —  1)-  or  (n,n  —  2)- 
cascade  is  also  a  cascade. 

Another  extreme  case,  when  k  =  1,  defines  an  im¬ 
portant  type  of  functions  -  (n,  l)-cascades.  These  func¬ 
tions  can  be  expressed  as  follows 

f(x  1,  ...,xn)  =  g(xh>. .  ^Xin^)  O  xin, 

for  a  dyadic  operation  o. 

By  C\{n)  we  denote  the  number  of  all  (n,  1)- 
cascades. 

Definition  3  For  any  function  g(X)  E  f)(n)  and  any 
variable  setY  =  {t/i, . . . ,  ym}  :  Y HX  =  0  a  (n+ra,  m)- 
cascade  f  - 

/(*i,  •  •  •  ,xn)  =  h(g,  yi, . .  .,ym),  h  6  T(m  + 1) 
is  called  an  m-cascade  extension  of  g. 

2.  The  Number  of  Cascade  Functions 

Theorem  1  The  number  C{n)  of  n-variable  cascade 
functions  n>  2  holds 

C(n)  —  Ci(n) 

~  ]C  -m)-C1(n-m)) (™)c{m),  ^ 


subfunction  is  not  an  (n  —  m,  l)-cascade.  The  latter 
sum  is  expressed  by 

£  (E(n  -  m)  -  Ci (n  -  m))  f™) C(m), 

m—1  '  ' 

where  C(n)  is  the  number  of  m-cascade  extensions  for  a 
chosen  non -(n  -  m,  l)-cascade  subfunction.  This  num¬ 
ber  C(n)  is  multiplied  by  the  number  of  choices  (™), 
and  by  the  number  of  subfunctions  E(n  —  m)  -  C\(n  - 
m). 

B.  Ci(n).  This  method  has  been  chosen  particu¬ 
larly  because  of  the  simplicity  of  counting  the  number 
Ci(n).  The  formula  (4)  is  built  using  the  principle  of 
inclusion  and  exclusion.  First  we  take  the  number  of 
all  1-cascade  extensions  of  all  essential  n  —  1-variable 
functions.  This  number  is  5(j)i?(n  -  1).  It  is  easy 
to  see  that  for  each  chosen  variable  there  are  only  5 
ways  of  1-cascade  extensions,  considering  all  10  essen¬ 
tial  dyadic  operations.  Then  the  principle  is  applied 
by  subtracting  and  adding  the  number  of  all  i-cascade 
extensions  of  (n-z)- variable  essential  functions,  count¬ 
ing  only  those  symmetrically  extendable  regarding  all 
i  variables. 

E(n  —  i)  is  multiplied  by  the  number  of  choices 
of  i  variables,  as  well  as  by  the  number  (2t+1  + 1)  of  all 
^-cascade  symmetric  extensions,  i.e., 

f(x i, . . . ,  Xfi)  —  g{x i,  •  •  •  5  Xj i— i)  O  ^(^n— i+l ,  •  •  •  > 


where  E(n )  is  the  number  of  essential  functions  given 

by  (l); 


n—  1 


<?i(n)  =  ]T(-l)<+1 

i=  1 


(2i+1  +  1  )E(n  -  i) 

—  (— l)n(2n+1  +  2); 


where  $(a?n~i+i, . . .  ,#n)  is  a  totally  symmetric  cas¬ 
cade.  There  are  2l  different  ways  to  choose  an  z-tuple 
1 , . . . ,  cin)  so  that 

f{x l)  •  •  •  yXn—i)  ,  dn)  ==  g{p^\'>  •  •  •  ?  «En— <)• 


C(k)  —  fc!  53  (_i)*-(n+-+r*) 

lrH - \-krk=k 

k 


X  (ri-| - +  rfc)!  JJ 


(2i+l  +  i)n 


^  r<!(z!)r4 


(5) 


Proof 

A.  The  Formula.  First  the  general  approach 
of  counting  all  cascade  functions  is  established.  The 
structure  of  the  formula  (3),  clearly  shows  the  method. 
The  number  of  n-cascades  is  equal  to  the  number  of 
all  (n,  1)  cascades  C\{n)  minus  the  sum  (by  m  = 
1 . .  .n  —  1)  of  the  number  of  all  functions  such  that 
each  one  is  an  (n,  m)-cascade,  but  the  corresponding 


If  o  is  a  so-called  “strong  operation”,  i.e.,  one  of  the  fol¬ 
lowing  eight  -  A=AND,  V=OR,  -iA=NAND,  -iV=NOR, 
>,  <,  >,  <,  then  for  all  other  values  of  {xn-i+ i,  •  •  • ,  xn) 
the  function  /  is  a  constant  0  or  1.  This  makes  the 
number  of  choices  at  least  2  •  2%  =  2l+1.  Finally,  there 
is  only  one  (essential)  extension  when  o  is  0=XOR  or 
r>j.  Thus  the  number  of  all  possibilities  is  2W  4- 1. 

Similar  constructions  as  above  are  used  in  [12]  for 
establishing  the  recurrent  expression  (2).  This  explains 
the  resemblance  of  our  formula  for  Ci(n)  with  (2). 

C.  C(k).  This  is  the  core  part  of  the  formula,  since 
by  counting  all  fc-cascade  extensions  we  deal  with  the 
essence  of  cascade  structures. 

The  formula  (5)  is  obtained  as  result  of  a  simplifiea- 
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n 

E(n) 

Ci(n) 

C(n) 

C(n) 

i 

2 

2 

5 

2 

2 

10 

10 

41 

10 

3 

218 

114 

497 

114 

4 

64,594 

3,922 

8033 

1,842 

5 

4,294,642,034 

1,596,666 

162,305 

27,226 

6 

«1.85e20 

128,830,610,650 

3,935,201 

902,570 

7 

«3.4e39 

«6.46e21 

111,313,697 

25,530,658 

8 

«1.158e78 

wl.36e41 

- 

825,345,250 

9 

«1.34ell7 

«5.21e79 

- 

30,016,622,298 

10 

wl.8e311 

«6.7el56 

- 

1,212,957,186,330 

Table  1.  Number  of  n-varlable  cascade  functions. 


tion  of  the  following  expression 

t~  1 

x  Y,  (2°1+1  +  l)ri  .....  (2a<+1  +  l)r‘ 

airiH - hatrt=fe 

_ fc! _ 

X  nKaiDri  • . •  rt!(at!)r*  * 

The  first  sum  indicates  that  we  apply  again  the  inclu¬ 
sion  and  exclusion  method.  The  first  member  of  the 
sum  is  5k  •  fc!,  which  is  the  number  of  all  orderings  of 
fc  variables,  times  the  number  of  fc-cascade  extensions 
with  respect  to  each  order.  This  number  is  just  an  up¬ 
per  bound  for  C(k),  i.e.,  C(fc)  <5 *  •  fc!,  since  there  are 
functions  that  are  counted  more  than  once.  Note  that, 
this  simple  upper  bound  is  quite  close  the  asymptotic 
formula  found  in  [12]  and  cited  in  the  previous  section. 

We  then  subtract  and  add  fc-cascade  extensions 
which  have  2  (minus),  3(plus),  ...  symmetric  variables. 
The  second  sum  is  taken  for  all  t-block  partitions  of 
the  number  fc.  These  partitions  correspond  to  the  par¬ 
titions  of  the  fc  variable  set  induced  by  the  symmetry 
relations.  The  fractional  expression  is  the  number  of 
partitions  which  themselves  add  up  to  the  Bell  num¬ 
ber.  They  are  multiplied  by  the  number  of  extensions 
with  corresponding  t-block  symmetries.  The  expres¬ 
sion  (2a*+1  +  l)ri  represents  r*  copies  of  the  same  a $- 
block,  each  of  which  is  counted  by  using  the  same  ar¬ 
gument  as  the  one  used  in  part  B  of  this  proof.  □ 

3,  Concluding  Remarks 


any  feasible  number  variables.  The  data  of  Table  1 
(computed  with  Maple  V)  shows  the  numerical  val¬ 
ues  for  C(n)  and  for  the  coefficients  involved,  for 
n  sss  1,2, . . . ,  10. 

The  numbers  E(n)  for  n  >  6  and  C\(n)  for  n  >  7 
are  very  large  (>  20  digits),  and  are  thus  presented  in 
the  table  as  approximations.  The  coefficient  C(n)  has 
been  computed  for  up  to  (7(7),  which  is  sufficient  for 
counting  (7(10). 

It  is  important  to  note  that  the  numbers  for  C(n) 
above  are  exactly  the  same  as  those  obtained  by  us¬ 
ing  a  computer  program  implementing  the  recurrence 
expression  in  [12]. 

After  certain  modification  the  method  for  counting 
cascades  shown  in  this  paper  can  be  used  for  estab¬ 
lishing  further  enumeration  results  for  some  classes  of 
logic  functions,  such  as  ortholinear  functions  (disjunc¬ 
tive  networks),  multiple- valued  cascades  and  multiple¬ 
valued  ortholinear  functions.  We  would  like  to  note 
also,  that  after  substitution  of  all  its  formal  subexpres¬ 
sions,  the  main  formula  for  C{n)  can  be  simplified  and 
presented  in  a  compact  form  of  n. 
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Abstract 

In  multiple-valued  logic  theories,  the 
characterization  of  Shejfer[l]  functions  is  an 
important  problem,  it  includes  the  decision  and 

construction  for  Sheffer  functions  in  Pk  and  .  The 
solution  of  these  problems  depends  on  the  solution  of 

the  decision  problem  for  completeness  in  Pk  and  Pk , 
and  reduced  to  determining  the  minimal  coverings  of 

precomplete  sets  in  Pk  and  Pk,  respectively,  this 
paper  studies  some  similarity  among  precomplete 

sets  preserving  m-ary  relation  in  Pk  ■ 

Keywords:  Partial  multiple-valued  logic,  Complete¬ 
ness,  Sheffer  functions. 

I.  Introduction 

Let  Ek  =  {0, 1,  •  -  - ,  A:  —  1},  k  >  2 .  The  total  and 
partial  k  -valued  logic  functions  over  Ek  are  called 
partial  £ -valued  logic  functions.  The  set  of  all 

partial  k  -valued  logic  functions  is  denoted  by  Pk  , 
namely 

Pk  =  -,*„)|  f-Enk  -*•  Ek  u  {*}»  »  *  1} 

where  =*  means  that  /  is 

undefined  at  point  («i, •••,«„)  £ Ek  .  We  denoted 
by  *  the  empty  function  that  is  undefined  at  every 
point. 

A  Function/(xi>"‘>*/»)  is  said  to  be  a 
Sheffer  function  if  any  function  in  Pk  can  be 
composed  by  /(*i  »•••>*«). 

*This  work  is  supported  by  a  grant  from  the  Natural 
Science  Fund  of  Hunan  Province  .China  A  function 


It  follows  from  the  theory  on  completeness  [2] 
that  /  is  a  Sheffer  function  if,  and  only  if,  for  any 

if, 

precomplete  set  A  cz  Pk  there  have  f  £  A .  But  the 

number  of  precomplete  sets  in  Pk  will  raise  rapidly 

with  increment  of  kt  so  it  is  necessary  to  find  a 
simpler  method  for  decision  on  Sheffer  functions. 
Although  any  two  precomplete  sets  A  and  B  in 

Pk  hold  that  Act  B  and  B  <t  A ,  one  precomplete 
set  may  be  included  in  union  of  some  precomplete 
sets.  Thus  when  we  decide  whether  f  is  a  Sheffer 
function  ,  those  precomplete  sets  which  are  included 
in  the  union  of  some  precomplete  sets  need  not  to  be 
checked  whether  f  is  included  in  them.  Therefore 
the  decision  on  Sheffer  functions  is  reduced  to 
determining  the  minimal  coverings  of  precomplete 
sets. 

The  problem  of  determining  the  minimal  covering 
of  precomplete  sets  in  P}  was  solved  in  1991  by  the 
author  [3],  and  for  k>  4 ,  the  author  has  proved  that 
Te  ,  LG)  2  and  Lp  are  the  component  parts  of  the 

minimal  covering  of  precomplete  sets  in  PE  ([4], 
[5],  [6]). 

In  this  paper,  we  shall  study  the  similarity  among 
precomplete  sets  preserving  m-ary  relation  in  partial 
k-valued  logic. 

II.  Some  Definitions 

Let 

&(*!.•••.*.)  eP/,i  =  l,— ,«• 

The  composition 

h{X\  ,Xn')  —  fi_g\  (Xj  >gm  C^i  ”  v^n)) 

is  defined  as  follows: 

For  arbitrary  ct  =  (ai>'">an)  » 
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if  g,(a),”*,gm(a)  are  all  defined,  then 
Ka)  =  f(gl(a),-,gm(a))\  and  if  one  of 
gi(cc),---,gm(a)  is  not  defined,  then 
h(a)  =*  (undefined). 

Let  The  fimction 

f(.g\  0*1  >* ' " »** is  said  to  be 
composed  from  A  if 

gi(xi>”'*xn)(j  =  1» '">m) either  belongs  to  A  or 
is  one  of  the  variables  .c, ,•••*„ ..  The  set  of  all 
functions  composed  from  A  is  denoted  by  A' . 

A(c  Pk  )  is  said  to  be  closed  if  A  -  A’ . 

A  is  said  to  be  complete  if  A  =  Pk ,  where 
A  =  A'v(A')'v((A'yy<J-. 

Let  M(c  Pk  )  be  a  closed  set,  if  there  is  no  other 

closed  set  between  M  and  Pk  ,  then  M  is  said  to 
be  a  precomplete  set  or  maximal  closed  set. 

Let  2  be  the  set  of  all  precomplete  sets  in  Pk  , 
S'cE.  We  denote  by  Sz,  the  set  of  all  functions 
f  eAj,  where  Aj  e  E ' ,  namely, 

Sz,={fe  P; 1 3 Aj((Aj  e  I')  a  (/  e  A,))} 

Set  £'(c  £)  is  said  to  be  the  covering  of  £  if 

sr=sz. 

Covering  S' is  said  to  be  minimal  if  any  proper 
subset  of  S'  is  not  the  covering  of  S  . 

Obviously,  for  a  precomplete  set  Ac  Pk ,  if  there 
exists  a  function  /  e  A  such  that  for  any  other 
precomplete  set  B(c  Pk  ) ,  f  €  B  then  A  must 
appear  in  the  minimal  covering  of  precomplete  sets 
of  Pk ,  so  we  call  A  the  component  part  of  the 
minimal  covering  of  precomplete  sets  of  Pk  . 

Let  0  c  E  c  Ek.  The  function  f(xx ,•  •  •  ,x„ )  is 
said  to  preserve  E  if  /(ctj, •••,«„)  =  *  or 
/(a1,---,an)’  belongs  to  E  for  arbitrary 
a\, •••,<!„  bE  .  The  set  of  all  functions  preserving 
E  is  denoted  by  TE. 

A  set  of  some  m-tuples  >,  a,-  e Ek 

is  called  an  m-ary  relation  over  Ek ;  denoted  by 
Gm.  a1  =(a{,---,aln),-",am  =  (aj" ,■••,«”)  are 
said  to  preserve  the  m-ary  relation  Gm  if 

<  aj >eGm,  i  =  l, •••,«  ;  a  function 
f(X]  ,•  •  • ,  xn  )  is  said  to  preserve  the  m-ary  relation 


Gm  if  for  arbitrary  a  ,•••,«  preserving  Gm 
either  <  f(a^),"-,f(am)>e.Gm  or  one  of 

f(a  *)>"'■>  fi0”1)  is  undefined.  The  set  of  all 
functions  preserving  Gm  is  denoted  by  T(Gm ) . 
Hereafter  we  always  assume  2  <  m  <  k . 

For  the  sake  of  unity,  we  look  on  TE  as  a  set  of 
functions  preserving  m-ary  relation  Gm.  Suppose 
without  loss  of  generality  that 

E-  {0,1,**-,j-1},  1  < s<£-l,  define  m-ary 
relation  as  follows: 

G„  =  {<«,«!, ■ >|«  =  0,1, •••,«-!, fly  €EkJ=U  ",m-l} 

Obviously,  \Gm\  =  s  X  km~'  <{k-  \)km~x  <  km 
and  there  exist  <aj,-*-,am  >eGm  such  that 
af  *  dj,i  *  j,i,j  =  therefore  Gm  is  a 

non-total  relation,  i.e.,  T{Gm )  <  Pk  .  We  shall  prove 
TE  =  T(Gm )  as  follows: 

Take  f(x\,"-,xn)  e  TE ,  thereby  for 
fll.-.a,  e£,  f{a\,‘“,an)  =  a,aeE  or 

=  *.  For  any 

a1  =  (a{, --^aj,  ),•••,  am  =  (a™  which 

preserves  Gm ,  since  <  aj ,-••,<*”  >eGm,, 
aj  eE,i  =  1  ,•••,« 

Thus /(a1)  =  /(«}, •••,««)  eisu  {*}  and 
f(ai)  =  f(a\,-,ain)eEku{*},i  =  2,-,m. 

It  follows  from  definition  of  the  Gm  that 
<f(a]),---,f(am)>eGm  or  one  of 

/(a’),---,/(am)  is  undefined.  Therefore 

f  eGm.  Consequently,  TE  c  T(Gm ) .  Since  that 

TE  is  the  precomplete  set  and  T(Gm)  <  Pk  , 
TE  =  T(Gm). 

Relation  Gm  and  G'm  is  said  to  be  similar  if 

and  there  exist  a  one-one 
mapping  a :  Ek  — >  Ek  such  that 

G'm  =  {<°(a\)>-Mam)>\<a\,->am  >^Gm} 
Clearly,  similar  relation  defined  above  is  equivalent 
relation.  We  denote  by  Gm=a  G'm  that  Gm  is 
similar  to  G'm  under  one-one  mapping  o  . 
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Ill.  Main  Results  and  Proof 


THEOREM  1.  Let  T(Gm)  be  a  precomplete  set 

except  Pk  ^  {*}  and  Gm  G'm .  Then  T{G'm ) 
is  also  a  precomplete  set. 

Proof.  From  [2],  all  precomplete  sets  except 

u  {*}  in  P/c  are  classified  to  6  classes,  they  are 
as  follows: 

Te  »  ^ S ,  ^R,  m »  Aj4t2  ^C4,2  ^  ^ P ' 

1 .  Let  T(Gm  )  =  Te  .  Suppose  without  loss  of 
generality  that  E  —  {0,1,  •••,5- 1},  1  ^  s  <  k- 1 , 
then 

Gm  =  {<«,«,, -,<!„_!  >|/  =  0,1,- -,s-\,aj  eEl,j=\,—,nt-\} 
It  follows  from  the  definition  of  similar  relation 
that 

G'm  =  {<  c(0»  <r(al  )>•  •  ■  )  >  I  <  t,0!  •  ’>am-\  >eGm} 

Let  E'  =  {o(0),---, o(s  - 1)} ,  it  is  easy  to 
prove  that  T{G'm)  —T g> .  Therefore  T{G'm)  is  a 
precomplete  set. 

2.  Let  T(Gm )  =  m .  Then 

Gm  =  U  Gm  ( {/,  j})  u  G*m ,  where  G*  is  either 
ij=l 
i*j 

empty  ( /n  *  2 )  or  only  contains  such  m-ary 
sequences  <  <*\ >  that 
at  *  aj ,  i  *  y,  t,y  =  1,-  •  •  ,m ,  and  Gw  is 
symmetric  to  Sm .  We  have  by  the  definition  of 
similar  relation  that 

G'm  =  {<cr(al),-”,cr(am)>\<al,‘--,am  >e Gm} 

Since  o  is  one-one ,  we  have 

m  * ' 

G'm  =  U  G'm{{Uj))vGm 
«.y®i 

i*j 

where 

G'm({i,j})  =  <au-,am  >6^,^/)  =  ^)} 

,i  *  y ,  1  <  i,  y  <  m, 

Gm  =  {<o-(a,),—,cr(flOT)>  <a1,---,am  >eGw} 
We  prove  G^  is  symmetric  to  Sm  as  follows: 

Suppose  7  ^ Em,  <■  bT^,”' 9bT^m^  >sGm  ,i.e., 
<bx,---,bm  >e  G'm .  By  definition  we  have  that 

<  <T  (^j )»*"*» (^m)  •  Let 

Cj  =  cr_1(fy),  /  =  1, •••,/»,  then 


<  Cj  ,•••  ,cm  Gm .  Since  Gm  —  Gm , 

<  cr(i)»-  •  •  .cr(m)  >6  Gm .  Therefore 

<  o (cr( [)),-••, a (cr(m))  >eG'm.  Obviously, 

cr(l)  =  <t_i  (6r(0 ),  /  =  1, •  •  •  ,ro.  So 

C(<V(<))  =  °‘(°'_,(6r(0))  =  ^r(i)  *  *  = 

Thus, 

Consequently,  G'mT  cG^, 

On  the  other  hand,  suppose  <b\,--->bm  >eG'm, 

then  <  cr_l  (6, ), •  •  • , cr_1  (bm)>eGm.  Since 


Gi=Gm,  GTm  =Gm.  Thereby 


~  *  m 

.  — “1/L  \ 


=><V|(]),"’V1(« >sG” 


=><bx,-”,bm  >e(?;r 


c;cg; 


It  means  G„T  =G'm,  i.e.,  G^  is  symmetric  to 
Sm. 

m 

3)  For  the  case  that  T(Gm)  is  one  of 
Si,m,SR^m,LG42(VGA2)  and  ,  the  proof  is 

similar  to  the  proof  given  above. 


THEOREM  2.  Suppose  Gm=a  G'm  and 
/(*i  >•">•*«)  eT(Gm).  Let 

=  <7(yX<T~ 1  (xj ),•••, <r_1  (xn ))),  rr(*) 
Then  g{xx,---,xn)  eT{G'm) . 

Proof.  Suppose  that  a  preserve  G'm , 

where  a1  =  (a\,---,a‘n),i  =  1,' — ,m.  Thus 
<a},---,aj  >eG'm,j  =  1, •••,«.  Fcom definition 
we  have  that 

g(a{ =  <7(/(<7_1  (aj ), •  •  •  ,<r-1  (a^ ))),  J  =  1, •  •  •  ,m 

and<  cr_1(flj),-",cr'"1(ay  )  >  Gm,y  =  1, 

Thereby. 

</(o-_l(o').— ,o’-1(oi)).-"»/(°'”1(ar)"">£r"l(ar))>eGiii 

So 

=<£7(/(<7-,(a,'),-.<T-1(ai))).-,(T(/(o--|(ar).-^',(a™)))>eGm 

Therefore  g(^i  > ^ . xn  )  e  T(> G'm ) . 
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THEOREM  3.  Suppose  that  Gm=a  G'm  and 

S 

T(Gm)  c  \JT(G'f!  )  (  Union  of  some  precomplete 
«=l 

sets),  2  <  hj  <  k,  and  when 
hi  =m,  T(Gm)  *  T(G£).  Then 

2XGi)c  U  JXOP.  where  G$  <% , 

and  when  ht  -m,  T{G'm)  *  T(G”r). 

Proof.  Suppose  •••,:*:„)  «=  T{G'm).  Let. 

/Ol  _1  (g(o-(x,  ),•••,  a(xn ))),  cr(*)  =  * 

Then  following  after  the  proof  of  theorem  2,  we  can 

prove  that  f(x  i, •••,*„)  eT(Gm)  c  (J  T(G%). 

Without  loss  of  generality  suppose  f  e  ) . 

Let  Gfy  =a  G^" ,  from  the  definition  of  / ,  we 
have 

g(x l .->*„)  =  (*i ), ■ ■ •  ( xn ))),  cr(*)  =  * 

Thus  it  follows  from  theorem  1  and  theorem  2  that 
T(Gf,'.')  is  a  precomplete  set  and 

g(xu-,xn)eT(G%). 

When  =  m ,  we  know  that  T(Gm)  *  r(G"). 

if  r(G;)  =  j(G"'). 

Suppose  f(x{ ,---,xn)  e r(Gw ) .  Let 

g(*l ,  ‘,x„)  =  a(f(cr~] (xy ),- •  ,(cr_1  (x„ ))),  <r(*)  =  * 
Then  it  follows  from  theorem  2  that 
g(xl,-,xn)eT(G^)c:nG'rZ),i.e., 
g(x\,  ">xn)  eT(G£). 

But, 

f(x l  ,•  •  • , )  =  cr_1  (gCtrC^!  ),•••, cr(xn  ))),  cr(*)  =  * 
since  G"  G"' ,  following  after  the  proof  of 

theorem  2  we  have  /(jcj  ,•  •  • , x„  )  €  T(G" ) ,  so 
T(Gm)  c  r(G") .  It  can  be  proved  in  the  same 
way  that  T(G £)  c  T(Gm) . 

Therefore  T(Gm )  =  T(G^)y  which  is  a 
contradiction.  Hence  T(G'm )  *  T(G^f) . 

Following  after  the  proof  of  theorem  2  and 
theorem  3,  we  have 

THEOREM  4.  Let  Gm=aG'm  and  there  exist 
f(x\,"',Xi)  eT(Gm)  such  that  for  arbitrary 
T(In)  T(Gm)  there  have  /  €  T(In) .  Then 
there  exist  •  ,*/)  6  T(G'm )  such  that  for 


arbitrary  T{I'n)  *  T{G'm)  there  have  g  £  T(I'n ) , 
where  In ,  I'n  are  the  set  of  n-ary  relations, 

2  <n<k . 

The  four  theorems  given  above  show  that  two 
sets  of  functions  preserving  two  similar  relations 
respectively  are  either  all  the  component  parts  of  the 

minimal  covering  of  precomplete  sets  of  Pk  or  all 
not. 
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Abstract 

Logic  offers  the  possibility  of  modeling  and  reason¬ 
ing  about  hardware  and  software .  But  which  logic?  We 
propose  monadic  logics  of  strings  and  trees  as  good  can¬ 
didates  for  many  kinds  of  discrete  systems.  These  log¬ 
ics  are  natural ,  decidable ,  yet  substantially  more  ex¬ 
pressive,  extensions  of  Boolean  logic.  We  motivate 
their  applicability  through  examples. 


1  Introduction 

Transistors,  pins,  and  power  supplies  are  the  stuff 
from  which  computers  are  made.  However,  this  view  is 
not  very  helpful  for  understanding  what  computers  ac¬ 
tually  do.  For  this  we  require  a  language  for  modeling 
the  structure  and  behavior  of  systems  and  for  relating 
these  models,  i.e.,  for  showing  that  the  system  modeled 
has  the  intended  behavior.  This  requirement  is  fulfilled 
by  logic. 

Boolean  logic  is  the  traditional  logic  of  switching 
theory.  It  is  often  applied  to  model  and  reason  about 
discrete  systems.  From  the  engineering  perspective,  it 
is  a  logic  for  expressing  operations  on  bits.  For  in¬ 
stance,  a  combinational  circuit  may  be  represented  in 
Boolean  logic  by  a  formula  whose  syntax  reflects  its 
structure,  e.g.,  an  and- gate  in  the  circuit  corresponds 
to  a  logical  and-operation.  Moreover,  one  can  model 
the  circuit’s  behavior  in  the  logic  and  verify  that  the 
two  models  axe  equivalent.  When  the  models  are  accu¬ 
rate,  the  result  predicts  the  behavior  of  actual  circuits. 

The  primacy  of  Boolean  logic  for  such  tasks  is  un¬ 
derstandable.  It  is  conceptually  simple  and  adequate 
for  modeling  the  structure  and  behavior  of  many  kinds 
of  discrete  systems.  Moreover,  verification  is  decid¬ 
able  and,  using  data-structures  like  BDDs,  many  prob¬ 
lems  have  acceptable  running  times.  However  there 
are  drawbacks.  Certain  kinds  of  systems,  even  when 
discrete  (or  viewed  as  such)  cannot  be  modeled  in 
Boolean  logic.  Moreover,  even  when  they  can,  the 


Boolean  model  may  differ  substantially  from  an  engi¬ 
neer’s  model,  and  this  makes  modeling  error  prone.  A 
solution  is  to  use  more  expressive  logics  like  first-order 
logic,  trading  increased  expressiveness  for  the  loss  of 
decidability.  Here  we  propose  a  middle  way,  namely, 
decidable  monadic  second-order  logics  of  strings  and 
trees  [5,  8,  14,  17]. 

We  motivate  this  proposal  by  examining  the  poten¬ 
tial  and  limitations  of  structural  and  behavioral  mod¬ 
eling  in  a  progression  of  logics,  from  Boolean  logic  to 
extensions  with  quantifiers  and  variables  ranging  over 
strings  and  trees  of  Booleans.  The  string  and  tree  log¬ 
ics  may  seem  exotic,  but  they  are  natural  and  signifi¬ 
cant  extensions  of  Boolean  logic  and  have  a  much  wider 
range  of  applications.  Moreover,  like  Boolean  logic, 
they  are  decidable  and  hence  offer  the  possibility  of 
automated  verification.  Thus,  although  monadic  log¬ 
ics  may  not  challenge  the  primacy  of  Boolean  logic,  we 
believe  that  they  are  well-suited  for  reasoning  about 
discrete  systems  and  deserve  a  place  in  the  repertoire 
of  engineers  working  with  such  systems. 

Although  the  use  of  the  monadic  logic  of  strings  in 
modeling  parameterized  combinational  and  sequential 
systems  has  already  been  established  (see  [3,  4]),  the 
exposition  here  is  from  a  different  perspective,  centered 
around  exploring  tradeoffs  and  providing  support  for 
the  above  proposal.  In  addition,  the  use  of  a  monadic 
logic  of  trees  for  hardware  modeling  and  verification  is 
new  and  presents  a  new  approach  to  reasoning  about 
hierarchical,  tree-structured  systems. 

2  Boolean  Logic 

We  begin  with  Boolean  logic  and  its  application 
to  structural  and  behavioral  modeling.  Although  this 
logic  is  adequate  for  many  problems,  it  has  limitations, 
which  motivate  generalizations. 

The  formulae  of  Boolean  logic  are  built  from  vari¬ 
ables  from  a  set  V  =  {x\,  a?2, . . . }  and  functions  (also 
called  connectives)  like  V,  A,  f*,  and  Formulae 
are  interpreted  in  the  set  ®  =  {0, 1}  of  truth  values. 
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An  evaluation  a :  V  -4  B  is  a  mapping  from  variables 
to  truth  values  that  is  homomorphically  extended  to 
formulae.  When  <r{(p)  =  1,  for  some  cr,  we  say  that  ip  is 
true  under  a  and  that  <p  is  satisfiable.  Since  the  satis¬ 
fiability  of  ip(x i , . . .  ,  Xn)1  depends  only  on  the  assign¬ 
ments  for  the  variables  yxny  we  can  represent 

an  assignment  as  a  tuple  b  =  (61, . . .  ,fen)>  for  bi  E  B, 
which  we  call  a  Boolean  model  A  formula  is  a  tautology 
( contradiction )  if  it  is  true  in  all  (no)  models. 

For  example,  the  formula  Xi  0  x<i  defined  by  ( x\  A 
->£2)  V  (->xi  A  X2)  semantically  defines  a  function  that 
is  true  in  the  models  (1,0)  and  (0,1).  Alternatively, 
since  a  function  on  Bn  -4  B  can  also  be  viewed  as  a 
relation  on  Bn,  0  defines  the  relation  {(1,0),  (0,1)}. 
As  the  standard  Boolean  connectives  are  functionally 
complete ,  every  relation  over  Bn  can  be  defined  by  a 
formula  in  the  logic. 

2.1  Example:  Gate  Level  Modeling 

Boolean  logic  can  be  used  to  model  discrete  systems. 
This  is  best  seen  with  the  example  of  combinational 
circuits.  Structurally,  we  can  model  a  circuit  C  with 
inputs  ,in  and  outputs  01,...  ,om  by  defining 

m  functions  on  Bn  -4  B  that  model  the  computations 
required  to  compute  each  output.  By  this  we  mean 
that  the  structure  of  the  circuit  is  reflected  in  the  func¬ 
tions  used  to  compute  it,  e.g.,  that  each  and-gate  in  the 
circuit  corresponds  to  an  A-connective  in  the  function 
definition. 

Note  that  when  we  talk  about  modeling  systems, 
our  use  of  the  term  model  is  different  from  the  previ¬ 
ous  semantic  use  (Boolean  model).  In  the  terminol¬ 
ogy  of  formal  methods  practitioners,  a  model  is  a  con¬ 
struct  or  mathematical  object  that  describes  a  system 
or  presents  a  theory  that  accounts  for  properties  of  a 
system.  Of  course,  these  two  uses  are  connected:  a 
structural  or  behavioral  model  has  a  semantics  given 
by  the  Boolean  model.  The  role  of  logic  then  is  to  con¬ 
nect  these  two.  By  proving  that  different  structural 
and  behavioral  models  are  equivalent,  we  show  that 
they  have  the  same  Boolean  models. 

Consider  a  simple  example.  Figure  1  describes  a 
circuit  for  a  1-bit  full-adder.  The  two  outputs,  sum 
and  cout,  are  functions  of  the  inputs  a,  6,  and  cm. 
We  can  model  the  structure  of  this  circuit  in  Boolean 
logic  by  formulae  that  reflect  how  sum  and  cout  are 
computed. 

sum(ay  6,  cin)  =  (a  ©  6)  0  cin 

cout(ay  by  cin)  =  ( cin  A  (a  0  b))  V  (a  A  6) 

1i.e.,  (p  has  xi , . . .  ,  xn  as  variables.  For  logics  with  quantifiers, 
we  mean  the  free  variables  of  <p. 


Figure  1. 1-bit  full  adder 

In  this  structural  model,  each  gate  corresponds  to  a 
Boolean  function  and  circuit  composition  is  modeled 
by  function  composition. 

The  semantics  of  these  formulae  are  a  model  of  the 
behavior  of  the  circuit.  However,  it  is  often  useful  to 
formalize  a  behavioral  model  within  Boolean  logic  so 
that  we  can  establish  theorems  about  the  equivalence 
of  different  structural  and  behavioral  models.  For  ex¬ 
ample,  we  can  model  the  behavior  of  sum  (a,  6,  cin)  by 
a  formula  sumJbeh(aybydn)  that  enumerates  the  in¬ 
puts  for  which  the  output  bit  sum  is  set  ( cout.beh  is 
defined  analogously),  i.e., 

{a  A  A  -1  cin)  V  (~ia  A  6  A  —>cin)  V 

(-■  a  A  -A  A  cin)  V  (a  A  b  A  cin) . 

Although  sumJbeh  is  logically  equivalent  to  sum ,  it 
is  perhaps  more  perspicuous  as  a  behavioral  description 
since  it  amounts  to  a  tabular  description  of  the  function 
computed.  Note  that  the  functional  completeness  of 
Boolean  logic  means  that  every  Boolean  function  can 
be  described  by  a  formula;  so  Boolean  logic  is  adequate 
for  this  kind  of  behavioral  modeling. 

Given  a  formalization  of  both  the  structure  and  the 
behavior  of  a  circuit  in  Boolean  logic,  we  can  use  a 
decision  procedure  to  validate  that  the  two  formaliza¬ 
tions  are  logically  equivalent,  and  thus  define  identical 
functions  semantically.  In  this  case  we  verify  that 

(sum(ay  by  cin)  44  sumJbeh{ay  6,  cin))  A 
(cout(ay  by  cin)  44  coutJbeh(ay  6,  cin)) . 

2.2  Limitations 

The  above  example  suggests  that,  for  certain  kinds 
of  discrete  systems,  namely  combinational  circuits, 
Boolean  logic  is  adequate  to  model  behavior  and  struc¬ 
ture  and  to  demonstrate  their  equivalence.  However, 
even  for  such  simple  systems,  there  are  inadequacies. 
In  particular,  the  connection  between  the  model  (the 
formula)  and  what  is  modeled  (the  circuit)  may  be  less 
direct  than  we  would  like.  For  example,  although  each 
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gate  in  the  circuit  corresponds  to  a  function  in  the  for¬ 
mula,  shared  components  must  be  duplicated  in  the 
model.  So  the  corresponding  formula  can  be  signifi¬ 
cantly  larger  and  the  relationship  between  subcircuits 
and  subformulae  obscured.  Moreover,  circuits  must  be 
modeled  as  functions  when  in  practice  some  are  better 
modeled  as  relations,  e.g.,  bi-directional  circuits. 

3  Quantified  Boolean  Logic 

A  simple  extension  to  Boolean  logic  is  to  add  the 
Boolean  quantifiers  V  and  3;  the  result  is  called  quanti¬ 
fied  Boolean  logic  (or  QBL).  In  the  semantics,  quanti¬ 
fiers  are  interpreted  as  ranging  over  B.  The  remaining 
details  are  identical  to  Boolean  logic. 

It  follows  semantically  that  we  can  eliminate 
quantifiers:  in  any  model  (&i,...,i>n)  G  Bn, 

3xn_}_i.  (p(xi , . . .  ,  xn,  xn+\)  is  semantically  equivalent 
to(p(xi , ...  ,  xn,0)V<p(xi,...  ,xn,l).  Replacing  V  with 
A  yields  a  similar  equivalence  for  V.  We  can  eliminate 
quantifiers  syntactically  too:  if  we  replace  0  (respec¬ 
tively  1)  in  the  above  expansion  by  any  suitable  contra¬ 
diction  (respectively  tautology),  then  we  can  use  the  re¬ 
sulting  equivalence  to  eliminate  the  Boolean  quantifiers 
in  any  QBL  formula.  The  resulting  formula  though 
may  be  exponentially  larger.  This  difficulty  is  reflected 
in  complexity  of  QBL:  it  is  PSPACE-complete.  So, 
provided  that  P  ^  NP ,  theorem  proving  for  QBL  is 
harder  than  for  Boolean  logic. 

Although  quantifiers  do  not  give  QBL  any  more  ex¬ 
pressive  power  than  Boolean  logic,  there  is  a  sense  in 
which  QBL  is  more  powerful:  some  Boolean  functions 
can  be  expressed  more  succinctly  and,  moreover,  QBL 
formulae  can  better  capture  the  structure  of  the  system 
modeled.  The  next  example  illustrates  this. 

3.1  Example:  Gate  Level  Modeling 

In  QBL,  we  can  formalize  circuits  as  relations  over 
their  external  ports.  Circuits  are  built  from  relations 
representing  primitives,  such  as  transistors  or  gates, 
and  are  combined  with  conjunction  and  ‘wired  to¬ 
gether’  using  existential  quantification.  This  style  of 
representation  is  standard  in  the  theorem  proving  com¬ 
munity  and  scales  well  to  complex  systems  [6]. 

For  example,  to  model  the  adder  given  in  Figure  1 
we  begin  by  defining  the  following  gates. 

and(a,  6,o)  =  of>(aA6) 

or(a,6, 6)  =  o  (a  V  6) 

xor(a,  6,o)  =  o  <4  ((a  A  ~^b)  V  (-»a  A  b)) 

We  then  compose  these  to  model  the  adder.  The  top 
half  of  the  adder  consists  of  two  x  or- gates,  connected 


by  an  internal  wire  w\y  which  compute  the  sum  bit  out. 
The  bottom  half  uses  the  internal  wire  w\ ,  as  well  as 
the  two  inputs  a  and  6,  to  compute  the  carry-out  bit 
cout.  Our  definition  conjoins  the  gate  descriptions  and 
projects  away  the  internal  wires: 

fu!Ladder(a,  6,  out,  tin,  cout)  =  3wi1u>2,W3- 

xor(a,&,wi)  Axor(u;i,  cin,  out)  A 

and(a,6,  W2)  A  and(cm,u;i,it;3)  A  or(w3,W21  cout) 

This  formula  is  a  better  structural  model  of  the 
circuit  pictured  in  Figure  1  than  the  previous  one  in 
Boolean  logic.  Of  course,  it  is  still  only  a  model  and 
it  abstracts  away  aspects  of  actual  circuits  such  as  lay¬ 
out.  This  is  acceptable  when  we  want  to  reason  about 
functional  behavior  at  the  level  of  bits,  but  unaccept¬ 
able  for  other  properties,  e.g.,  delay.  Modeling  other 
aspects  requires  either  formalizing  them  in  QBL  or  a 
richer  logic. 

3.2  Limitations 

Although  QBL  offers  advantages  over  Boolean  logic, 
many  kinds  of  discrete  systems  cannot  be  modeled  in 
it.  In  our  example,  we  modeled  the  static  behavior  of 
a  system,  i.e.,  the  relationship  between  its  inputs  and 
outputs,  which  constitute  the  system  state.  However, 
it  is  difficult  to  model  dynamic  behavior ,  that  is,  how 
the  state  changes  over  time. 

We  can  only  approximate  this.  Namely,  rather 
than  formalizing  a  predicate  (p  of  the  state  variables 
x,  we  instead  formalize  a  Boolean  ‘before-after’  predi¬ 
cate  </?'(x,  x')  that  contains  a  ‘primed’  version  x'  of  each 
state  variable  x  and  formalizes  how  the  state  changes 
over  one  unit  of  time.  In  this  way  we  can  formalize 
dynamic  behavior  over  any  fixed  number  of  time  inter¬ 
vals;  however  formalizing  behavior  over  arbitrary  time 
intervals  is  impossible. 

A  second  limitation  is  that  these  logics  cannot  be 
used  to  model  families  of  related  systems.  For  exam¬ 
ple,  our  1-bit  adder  specifications  can  be  generalized 
to  adders  over  2,  3,  ...  bit  words.  Hardware  descrip¬ 
tion  languages  such  as  VHDL  [1]  typically  provide  some 
kind  of  parameterization  mechanism  which,  combined 
with  iteration,  allows  parametric  descriptions.  Again, 
this  is  impossible  using  the  weak  logics  that  we  have  so 
far  considered. 

It  turns  out  that  a  generalization  of  the  approxima¬ 
tion  idea  described  above  can  be  used,  in  many  cases, 
to  solve  both  problems.  Rather  than  writing  a  formula 
< p(x )  with  models  b  G  Bn,  we  instead  let  x  range  over 
sequences  of  Boolean  models  6(0)6(1) . . .  b(m)  of  arbi¬ 
trary,  but  finite,  length.  That  is,  a  model  is  a  string 
that  states  how  the  values  (Boolean  model)  of  the  n 
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state  variables  evolve  over  m  time  intervals.  Alterna¬ 
tively,  the  model  can  represent  the  behavior  of  a  system 
with  n  state  variables,  each  ranging  over  values  of  an 
arbitrary  ‘length’ m.  This  generalization  requires  gen¬ 
eralizing  QBL,  which  is  a  logic  of  bits,  to  a  logic  of  bit 
sequences,  or  strings. 

4  Monadic  Logic  of  Strings 


We  present  here  a  monadic  logic  of  strings  called 
WS1S.  Work  on  WS1S  and  related  monadic  theories 
goes  back  to  Biichi  [5],  Elgot  [8},  and  Trakhtenbrot  [18]. 
We  briefly  review  syntax  and  semantics. 

Let  x  and  X  range  over  disjoint  sets  Vi  and  V2 
of  first  and  second-order  variables.  The  language  of 
WS1S  is  described  by  the  following  grammar. 

t  ::=  x  |  0  |  s (t) 

p  X(£)  |  pAp  \  ->p  \  3x.p  \  3 X.p 

Hence  terms  are  built  from  first-order  variables,  the 
constant  0,  and  the  successor  symbol.  Formulae  are 
built  from  atoms  X(£)  and  are  closed  under  conjunc¬ 
tion,  negation,  and  quantification  over  first  and  second- 
order  variables.  Other  connectives  and  quantifiers  can 
be  defined  using  standard  classical  equivalences,  e.g., 
Vx.  p  =  “«3x.  —*p. 

Semantically,  WS1S  formulae  are  interpreted  in  N. 
0  and  s  are  zero  and  the  successor  function  and  X(t) 
is  true  when  the  number  denoted  by  t  is  in  the  set  of 
numbers  denoted  by  A.  First-order  quantification  is 
quantification  over  natural  numbers,  whereas  second- 
order  quantification  is  quantification  over  finite  sets  of 
natural  numbers. 

WS1S  stands  for  the  weak  second-order  monadic 
theory  of  one  successor.  It  is  monadic  as  all  atoms 
are  unary  predicates.  It  is  weak  because  second-order 
variables  are  interpreted  over  finite  (instead  of  arbi¬ 
trary  subsets)  of  the  domain.  And  it  has  only  a  single 
successor  function. 

Although  WS1S  is  a  logic  of  numbers  and  sets, 
we  motivated  it  as  a  logic  of  strings,  as  this  empha¬ 
sizes  that  WS1S  is  a  natural  extension  of  (quantified) 
Boolean  logic.2  The  relationship  is  as  follows.  Any  fi¬ 
nite  string  b(0)b(l) . . .  b(m)  over  B  encodes  a  finite  set 
of  positions,  namely  {p  €  {0, . . .  ,  m}  |  b(p)  =  1}.  More 
generally,  we  can  encode  n  strings  over  B  as  a  single 
string  over  Bn.  Hence,  if  p{X)  is  a  formula  whose  free 
second-order  variables  are  X  =  ,  Xn,  a  WS1S 

2 This  view  is  also  taken  by  Thomas  in  [16]  where  Boolean 
logic  is  used  to  motivate  a  closely  related  monadic  logic  of  strings, 
which  he  calls  Buchi-Elgot-Trakhtenbrot  logic. 


model  can  be  encoded  by  a  finite  string  over  the  alpha¬ 
bet  Bn . 

As  a  simple  example,  the  formula  p(X ,  Y )  given  by 
X (0)  A  Vp.  (X(p)oY(s(p))) 


states  that  the  string  X  begins  with  a  1  and  that  Y 
equals  X  shifted  one  position Jto  thejright.  A  model 
for  this  formula  is  a  string  b(0)b(l) . . .  b(m)  where  each 
b(i)  is  a  letter  in  B2 .  To  visualize  this,  we  write  letters 
(61,62)  vertically;  the  first  row  encoded  in  the  string 
determines  an  interpretation  for  X,  and  the  second  for 
Y .  Two  such  models  are 


1 . 

0 

1 

0 

1 

1 

0 

1 

and 


0 

1 

0 

0 

0 

0 

The  first  model,  for  example,  interprets  X  as  {0, 2}  and 
Y  as  {0, 1,3}.  (p  is  true  in  the  first  model  and  false  in 
the  second. 

Since  models  are  strings,  the  set  of  models  that  sat¬ 
isfy  a  formula  constitutes  a  language.  A  fundamental 
theorem  for  WS IS  is  that  this  logic  precisely  captures 
the  regular  languages:  the  language  associated  with 
each  WS1S  formula  p  is  regular,  and  vice  versa.  This 
is  the  key  to  decidability.  Using  automata  theoretic 
techniques,  given  a  formula  ip,  we  can  construct  an  au¬ 
tomaton  that  recognizes  the  models  that  satisfy  p.  A 
formula  p(X)  is  a  tautology  when  the  corresponding 
automaton  accepts  the  universal  language  on  Bn .  The 
‘vice  versa’  part  of  this  theorem  gives  us  an  analog  of 
functional  completeness  for  WS1S:  we  can  model  the 
behavior  of  any  system  whose  behavior  is  regular  in  the 
language  theoretic  sense. 

WS1S  is  substantially  more  expressive  than  QBL, 
but  this  comes  at  a  high  price.  WS1S  is  nonelemen¬ 
tary  decidable  [12]:  a  formula  of  size  n  may  require 
time  and  space  bounded  below  by  an  iterated  stack  of 
exponentials  whose  height  is  proportional  to  n. 


4.1  Tools  for  WS1S 


Despite  the  high  complexity  of  WS1S,  several 
groups  have  implemented  decision  procedures  for  it 
[10,  11,  13]  that  work  surprisingly  well  on  many  non¬ 
trivial  problems.  All  examples  in  this  paper  have  been 
run  in  the  MONA  system  [10],  which  implements  a  de¬ 
cision  procedure  for  WS1S  and  its  generalization  to 
trees  considered  in  the  following  section. 

MONA  offers  various  kinds  of  syntactic  sugar  that 
are  useful  in  large  specifications,  which  we  employ  in 
our  examples.  For  example,  first-order  variables  are  ac¬ 
tually  defined  in  terms  of  second-order  variables  (where 
a  first-order  variable  is  interpreted  by  a  string  with  a 
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Figure  2.  An  n-bit  adder  instance,  for  n  -  3 


1  in  only  a  single  position,  giving  the  value  of  the  first- 
order  term).  Hence  formulae  may  have  free  variables 
of  either  order.  We  also  employ  sugared  notation  for 
QBL,  which  is  definable  in  WS1S  by  reasoning  about 
the  Oth  position  of  second-order  variables.  For  exam¬ 
ple,  the  QBL  formula  Vp.  3q.  p  f*  q  can  be  defined 
by  the  WS1S  formula  VP.3Q.P(0)  O  Q{ 0).  MONA 
supports  the  use  of  Boolean  variables  and  Boolean 
quantifiers.  To  distinguish  between  Boolean,  first, 
and  second-order  variables  we  use  the  following  nam¬ 
ing  convention:  Second-order  variables  are  capitalized, 
first-order  variables  are  lower-case  letters  with  names 
like  i,  p,  and  q ,  and  Booleans  are  lower-case  strings  like 
a,  6,  cin  and  ccmt.  Moreover,  we  superscript  quantifiers 
with  either  a  0,  1,  or  2  to  indicate  Boolean,  first,  and 
second-order  quantification  respectively.3  Finally  from 
the  minimal  syntax  of  WS1S  we  have  given,  one  can 
define  additional  operators  with  their  expected  mean¬ 
ing,  e.g.,  comparison  operators  on  terms  like  =  and  <. 
We  use  such  operators  freely. 

4.2  Example:  Parameterized  Ripple-Carry 

Adder 

Here  we  give  a  simple  example  of  how  WS1S  can 
be  used  to  structurally  and  behaviorally  model  a  pa¬ 
rameterized  family  of  circuits.  For  more  sophisticated 
examples,  as  well  as  examples  of  modeling  dynamic 
(sequential)  systems,  see  [4]. 

Let  us  start  with  a  structural  model.  We  have  previ¬ 
ously  shown  how  to  model  a  1-bit  adder.  Figure  2  sug¬ 
gests  how  an  adder  can  be  iterated,  in  a  ‘ripple-carry’ 
way,  to  construct  an  n-bit  adder,  for  n  =  3.  In  general, 
we  can  model  an  n-bit  adder  by  wiring  together  n  1- 
bit  adders  where  the  carry-out  of  the  ith  adder  is  the 
carry-in  of  the  i-flst.  The  first  carry  has  the  value  of 
the  carry-in  and  the  last  has  the  value  of  the  carry-out. 

It  is  easy  to  formalize  a  ripple-carry  architecture  by 
a  WS1S  formula.  If  we  use  C  to  represent  the  sequence 
of  carries,  we  can  formalize  the  general  case  by  the  fol¬ 
lowing  formula,  which  relates  three  strings  (the  inputs 
A  and  B  and  the  output  S)  and  two  Booleans  (the 

3  Any  remaining  ambiguity  (indeed,  even  without  any  of  these 
conventions)  can  be  resolved  from  context. 


carry-in  cin  and  carry-out  cout)]  the  number  of  bits 
added  is  given  by  the  parameter  n. 

adder(n,  A,  B>  5,  cm,  cout)  = 

3 2C.  (C( 0)  <->  cin)  A  ( C(n )  +*  cout)  A 
V1/?.  p  <  n  -> 

fulLadder(A(p),  B{p),  5(p),  C(p),  C(p  +  1)) 

This  is  a  direct  formalization  of  our  natural  language 
description  where  quantification  over  positions  p  for¬ 
malizes  iteration  (generalized  conjunction)  over  the  n 
1-bit  adders.  Note  that  given  the  encoding  of  Boolean 
logic  in  WS IS,  we  can  use  the  previously  given  struc¬ 
tural  model  of  a  1-bit  full  adder. 

Let  us  now  turn  our  attention  to  a  behavioral  model 
in  which  we  state  how  strings,  representing  n-bit  bi¬ 
nary  numbers,  are  added.  WS1S  is  a  logic  of  strings 
and  arithmetic  must  be  encoded  within  its  limited  lan¬ 
guage.  We  model  addition  by  formalizing  the  standard 
algorithm  for  adding  base-two  numbers:  The  ith  out¬ 
put  bit  is  set  if  the  sum  of  the  ith  inputs  and  carry-in 
is  1  mod  2,  and  the  ith  carry  bit  is  set  if  at  least  two 
of  the  previous  inputs  and  carry-in  are  set.  Since  we 
specify  the  behavior  of  an  n-bit  adder,  we  must  restrict 
the  addition  modulo  2n  and  compute  the  carry  values 
as  special  cases. 

at  Jeast_two(a,  b,c,d)  =  d  <->  (a  A  b)  V  (a  A  c)  V  (6  A  c) 
mod_two(a,  6,  c,  d)  =  aH&Hcorf 


adder_beh(n,  A,  £?,  5,  cm,  cout)  = 

32C.  (C( 0)  cin)  A  (C(n)  B-  cout) 

V*pp  <n-± 

atJeast_two(A(p),B(p),C(p),Cr(p  +  1)) 
modjtwo(A(p),  J3(p),  S(p),  C(p)) 

This  specifies  a  language  over  B6 ,  which  encodes  in¬ 
terpretations  for  n,  A,  By  5,  cin  and  cout.  For  example, 
one  string  in  this  language  is: 


0 

1 

2 

3 

4 

n 

0 

0 

0 

0 

1 

A 

1 

1 

0 

0 

0 

B 

1 

0 

0 

1 

0 

S 

1 

0 

1 

1 

0 

cin 

1 

0 

0 

0 

0 

cout 

0 

0 

0 

0 

0 

(i) 


Recall  that  in  the  interpreting  string  for  first-order  vari¬ 
ables  like  n,  exactly  the  nth  bit  is  1.  Moreover,  cin  and 
cout  are  true  iff  the  Oth  bit  of  their  interpreting  string 
is  1.  Hence  the  above  encodes  that  the  bit- width  n  =  4. 
Reading  Boolean  strings  as  binary  numbers  from  left 
to  right  (up  to  the  nth  position)  the  next  three  lines 
encode  A  =  3,  B  =  9,  and  5  =  13.  Since  the  carry-in 
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is  set  and  the  carry-out  isn’t,  this  string  does  is  indeed 
in  the  addition  relation  modeled. 

The  adder’s  structural  and  behavioral  models  are 
similar  in  that  they  both  iterate  components  that  add 
a  single  digit.  Using  similar  models  may  be  natural, 
but  it  runs  the  risk  of  making  similar  mistakes  in  both. 

An  alternative  behavioral  model  that  avoids  this 
problem  is  based  on  the  equation 

val(n,S)  +  2  nvl(cout)  — 

val(n,  A)  +  val(n,B)  +  vl(cin) ,  ' 

where  val(n ,  5)  computes  the  value  of  a  string  5  as  an 
n-bit  word,  and  vl  converts  a  Boolean  to  a  number. 
With  this  model  we  define  once  and  for  all  an  encoding 
of  natural  numbers  as  strings,  together  with  predicates 
encoding  (Presburger)  arithmetic  operations.  This  en¬ 
coding  of  arithmetic  can  be  reused  when  specifying  the 
behavior  of  other  circuits,  like  counters,  comparators 
and  the  like. 

val(n,  X ,  Y)  =  V»p.  Y{p)  (p  <  n  A  X(p)) 

powof2(n,  6,  X)  =  \Ap.  X (p)  *->  (p  =  n  A  b) 

vl (6,  N)  =  V*p.  AT(p)  o  (p  =  0  A  b) 

add  (A,B,5)  =  32<7.-<7(0)A 

V1]?.  atJeast_two(A(p),  B(p),  C(p),  C(p  +  1)) 
Amod.two(A(p),B(p),  S(p),  C(p)) 

We  now  directly  encode  (2)  as 

adder_beh(n,  A ,  B,  5,  cm,  ccm£)  = 

32A',  B',S',  CJAT,  CWT,  £/,  V,  W. 

vl(cm,C77V)  A  powof2(n,  cout,  COUT) 
Aval(n,A,  Af)  A  val(n,B,B') 

Aval(n,S,  S') 

A  add  (S',  COUT, U)  A  add(A',B',V) 

A  add  (V,  £7 JiV,  W)  A  U  ~W . 

4.3  Verification 

We  now  have  both  a  structural  model  and  two  equiv¬ 
alent  (and  this  equivalence  has  been  automatically 
checked  by  Mona),  but  differently  motivated,  behav¬ 
ioral  models  for  a  family  of  n-bit  adders.  We  can  for¬ 
malize  the  correctness  of  the  structural  model,  with 
respect  to  either  behavioral  model,  as  follows. 

\An.  V2  A,  B,  S .  V°cm,  cout. 
adder(n,  A ,  B,  5,  cin,  cout) 
adder_beh(n,  A,  B,  S',  cm,  cou£) 

This  is  verified  by  MONA  in  under  a  second  on  a  Sun 
Ultra-Sparc  workstation.  This  tells  us  that  any  adder 
in  this  family,  e.g.,  a  64-bit  adder,  will  operate  as  ex¬ 
pected,  provided  the  gates  it  is  built  from  function  as 
specified.  Such  a  parameterized  statement  cannot  be 
expressed  in  QBL. 


4.4  Limitations 

Any  structural  or  behavioral  model  formalized  in 
WS1S  defines  a  regular  language  over  a  fixed  alphabet 
Bn.  This  limits  what  can  be  modeled. 

To  begin  with,  observe  that  there  is  an  asymmetry: 
given  a  formula,  the  alphabet  size  and  thus  the  ‘width’ 
of  any  string,  is  fixed,  while  the  length  of  any  string  is 
finite,  but  a  priori  unbounded.  As  motivated  in  Sec¬ 
tion  3.2,  we  can  use  the  one  ‘unbounded  dimension’ 
to  model  the  dynamic  (sequential)  behavior  of  systems 
or,  alternatively,  families  of  related  systems,  e.g.,  pa¬ 
rameterized  by  their  bus- width.  However,  we  cannot 
overcome  this  asymmetry  and  model  systems  where 
both  the  width  and  length  dimension  are  unbounded, 
i.e.,  problems  that  can  be  modeled  on  two-dimensional 
grids  instead  of  one-dimensional  strings.4  As  a  result, 
many  practical  problems  concerning  discrete  systems 
cannot  be  modeled.  For  example,  we  cannot  reason 
about  families  of  sequential  systems  such  as  n-bit  reg¬ 
isters  or  counters. 

Furthermore,  we  can  only  model  systems  with  regu¬ 
lar  behavior.  A  parameterized  adder  defines  a  regular 
language;  the  intuition  for  this  is  that  an  automaton 
can  read  a  string  of  bits  like  that  in  (1)  and  it  need  only 
propagate  bounded  information  across  each  letter  (e.g., 
information  about  the  current  carry)  to  determine  if 
a  string  is  in  the  language.  However  not  all  parame¬ 
terized  systems  have  this  property.  For  example,  the 
language  associated  with  a  family  of  n-bit  multipliers 
is  not  regular. 

Despite  these  limitations,  WS1S  is  quite  expressive. 
Since  every  regular  language  can  be  represented  by 
a  WS1S  formula,  any  discrete  system  whose  behav¬ 
ior  is  regular,  or  that  can  be  structurally  modeled  as 
a  finite  state  transition  system,  can  be  formalized  in 
WS1S.  And  this  covers  many  systems  considered  in 
practice.  Furthermore,  in  some  cases,  we  can  model 
two-dimensional  systems  that  use  the  second  dimen¬ 
sion  in  a  limited  way.  For  example,  those  systems  that 
correspond  not  to  problems  on  grids,  but  to  problems 
on  trees.  Such  systems  play  an  important  role,  e.g., 
in  designing  efficient  circuits  using  divide  and  conquer 
techniques. 

5  Monadic  Logic  of  Trees 

In  Section  4  we  modeled  an  n-bit  ripple-carry  adder 
by  cascading  n  adders.  This  is  correct  although  ineffi¬ 
cient  since  a  delay,  linear  in  n,  is  required  to  propagate 

4Modeling  problems  on  grids  requires  a  stronger  logic,  e.g., 
with  dyadic  predicates.  This  leads  to  undecidability  as  one  can 
encode  problems  about  Turing  machine  computations  in  them. 
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the  carry  bits  between  stages.  An  alternative  design  is 
a  tree-structured  circuit  with  a  logarithmic  delay. 

We  cannot  model  tree-like  systems  using  a  logic  of 
strings.  Fortunately,  there  is  a  simple  generalization 
that  suffices.  Rather  than  having  a  single  successor  for 
the  next  position  in  the  string,  we  have  two  successors, 
for  ‘left’  and  ‘right’.  Then,  just  as  finite  sets  of  po¬ 
sitions  over  one  successor  are  encoded  by  strings,  sets 
of  positions  over  two  successors  are  encoded  by  binary 
trees.  Working  out  the  details  yields  the  weak  second- 
order  monadic  theory  of  two  successors  (or  WS2S) 
[9,  15,  17]. 

The  syntax  of  WS2S  is  similar  to  WS1S  except  we 
use  e  (instead  of  0)  to  denote  the  root  address  in  a 
tree  and  we  now  have  two  successor  .0  (left  child)  and 
.1  (right  child).  Semantically,  WS2S  formulae  are  in¬ 
terpreted  in  the  domain  V  =  {0, 1}*  and  e  denotes 
the  empty  string,  .0  denotes  concatenation  with  0,  and 
.1  denotes  concatenation  with  1.  All  other  details  are 
identical  to  WS1S. 

WS2S  generalizes  Boolean  logic  to  a  logic  of  trees 
because  second-order  variables  are  interpreted  by  fi¬ 
nite  subsets  M  CV  and  any  such  M  can  be  encoded 
by  a  Boolean-labeled,  binary  branching  tree.  The  de¬ 
cidability  of  this  logic  follows  by  generalizing  the  deci¬ 
sion  procedure  for  WS1S:  the  models  of  any  formula 
<p(x i , . . .  ,  xn)  define  a  regular  tree  language  over  the  al¬ 
phabet  Bn  and  given  (p  we  can  construct  a  tree  automa¬ 
ton  accepting  this  language.  As  with  WS1S,  WS2S  is 
non-elementary. 

The  following  is  a  simple  example  of  a  specification 
in  WS2S. 

X(e)  A  (VV.X(p.O)^X(p.l))  A  V*p.  -*Y{p.0)  V  ->Y(p.l) 

This  states  that  X  contains  the  root  position  e  and 
that  a  position  p  is  in  X  iff  its  brother  is  also  in  X. 
Moreover,  for  any  p,  Y  contains  at  most  one  of  p’s  suc¬ 
cessors.  The  interpretation  {5,0,1,00,01}  for  X  and 
{0, 01, 11}  for  Y  satisfies  this  formula  and  is  encoded  by 
the  following  tree  (where  the  first  component  of  each 
pair  encodes  the  interpretation  of  X  and  the  second 
encodes  Y). 


AqBq  S0  Ay  B}  Sy  A2#2  S2  A3  £3  S3 


Figure  3.  An  n-bit  Cla  instance,  for  n  =  4 


5.1  Example:  Parameterized  Carry-Lookahead 
Adder 

We  begin  by  describing  a  family  of  n-bit  carry - 
lookahead  adders  (or  Clas)  whose  overall  structure  is 
given  (for  n  =  4)  in  Figure  3.  We  do  not  describe  the 
adder  in  detail  (see  [7])  and  restrict  ourselves  to  a  few 
comments.  The  adder  operates  in  two  phases:  an  up¬ 
ward  phase  and  downward  phase.  In  the  upward  phase, 
a  carry  status  is  computed  for  each  internal  node.  This 
indicates  whether  an  incoming  carry  is  killed ,  propa¬ 
gated ,  or  a  new  carry  is  generated.  The  carry  status 
is  given  by  the  wires  E\  and  E2.  At  the  leafs  of  the 
tree,  the  carry  status  for  each  digit  is  computed  using 
the  circuit  depicted  in  Figure  4(a).  At  inner  nodes,  an 
operator  ®  is  used  to  combine  the  carry  statuses  of  the 
successor  nodes  (see  Figures  4(d)  and  4(b)). 

In  the  downward  phase,  an  inner  node  passes  the 
incoming  carry  bits  F±  and  F2  unchanged  as  signals  jF\° 
and  F§  to  its  left  successor;  the  carry  bits  F}>  F%  for 
the  right  successor  are  computed  using  the  operator  ® 
from  the  carry  statuses  and  the  incoming  carry  bits  (see 
Figure  4(d)).  As  indicated  in  Figure  4(c),  the  incoming 
carry  at  the  root  is  given  by  cin  and  the  outgoing  carry 
by  cout.  At  the  leaf  nodes,  the  incoming  carry  bits  Fi 
and  F2  are  used  to  compute  the  sum  bit  as  depicted  in 
Figure  4(a). 

5.2  Structural  Modeling  of  the  Cla 

Structural  modeling  in  WS2S  is  identical  to  WS1S 
except  now  we  use  two  successors  to  index  into  trees. 
Hence  we  translate  the  circuits  in  Figures  4(a)-4(d) 
directly  into  the  following  WS2S  formulae.  Note  that 
LeafCell  and  NodeCell  take  an  extra  argument  p  that 
indicates  which  leaf  or  node  values  are  used. 
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z  1  Z2 


5  A  B 


(a)  LeafCell  Circuit  (b)  ®  Circuit 


(VVOeaffoT)  -4  LeafCel \{AyB,S,FuF2,EuE2,p))  A 
(nod e(p,T)  -4  NodeCell(Fi,F2,Fi,F2,p)))  A 
shape_cond(A,  B,  5, T) . 

RootCell  initializes  the  carry-in  bit  and  computes  the 
carry-out  bit.  The  next  two  lines  correspond  to  a  for- 
loop  with  discrimination  (pattern  matching)  for  each 
position  p.  The  first  implication  gives  the  base  case: 
each  leaf  of  the  circuit  is  LeafCell.  The  second  gives 
the  step  case:  each  inner  node  of  the  circuit  is  NodeCell. 
The  predicate  shape.cond  fixes  the  shape  of  the  inputs; 
we  explain  it  shortly. 

5.3  Behavior 


Fx  F2  £,  £2 


(c)  RootCell  Circuit 


(d)  NodeCell  Circuit  (op) 


Figure  4.  Components  of  the  Cla  Circuit 


LeafCelI(A,B,5,  Fi,F2,  Fi,F2,p)  = 

and(A(p),B(p),Fi(p))  A  or(A(p),B(p), E2(p))  A 
30wi,iu2.and(Fi(p),F2(p),u;i))  A 

xor  (A(p),B(p),w2)  A  xor(wi, w2, S(p)) 


NodeCell  (Fi,  F2,  Fi,  F2,p)  = 

(Fi(p.O)  44  Fi(p))  A  (F2(p.O)  44  F2(p))  A 
op(Fi  (p) ,  F2  (p) ,  Ei (p.O) ,  E2  (p.O) ,  Fi  (p.l) ,  F2  (p.O))  A 
op(Fi  (p.O) ,  E2  (p.O) ,  Fi(p.l),  B2(p.l),  Fi(p),  F2(p)) 

RootCell(Fi,F2,Fi,F2,cm,cou*)  = 

(cm  44  Fi(c))  A  (cm  44  F2(e))  A 
(3 °a,  6.  op(Fi (5),  F2(c), Fi(£),  F2(e), a,  6)  A 
and(a,6,  cout)) 


op(a?i,X2,yi,y2,zi,^2)  =  30iui,W2)iu3,W4,u>5. 
not(xi,wi)  A  and(yi,y2,tU2)  A  and(z2j  y2,  W3)  A 
and(tui,  w2,wt)  A  and(xi,tt;3) ^5)  A 
or(w4,w$,zi)  A  or(w4ywsyZ2) 


In  WS1S  we  encoded  numbers  as  bit-strings.  In 
WS2S  we  have  binary  trees  and  encode  numbers  using 
the  labels  on  a  tree’s  frontier.  For  example,  the  follow¬ 
ing  trees  represent  the  bit-strings  10011  and  11001. 


An  important  requirement  in  specifying  tree  struc¬ 
tured  adders,  is  that,  once  we  fix  the  format  of  the 
adder  to  be  a  particular  shape,  both  inputs  and  the 
output  must  also  be  of  that  shape;  that  is,  numbers 
must  be  encoded  at  leaf  nodes  in  the  same  positions  in 
these  three  trees.  This  kind  of  requirement,  which  cor¬ 
responds  roughly  to  a  kind  of  ‘type-correctness’  for  the 
inputs,  is  easy  to  specify  in  a  high-level  programming  or 
specification  language  where  recursive  data-types  can 
formalize  this  type  (or  ‘shape’)  constraint.  In  our  set¬ 
ting,  we  define  a  predicate  (shape.cond,  which  we  used 
in  the  specification  of  cla  above)  that  enforces  this  re¬ 
quirement. 

We  proceed  in  several  steps.  First,  we  characterize 
those  trees  with  a  particular  shape  as  those  T  where: 

1.  T  is  not  emtpy. 

2.  T  is  closed  under  the  parent  relation:  if  a  position 
p  is  in  T  then  its  parent,  denoted  by  pf,  is  in  also 
in  T. 

3.  If  an  inner  node  p  is  in  T  then  its  both  successors 
p.O  and  p.l  are  in  T  too. 


The  overall  circuit,  given  in  Figure  3,  is  modeled  by 
the  predicate 

c\a(A,  B,S>  Ei,  E2yFi,F2,  tin,  cout)  =  3  2T. 

RootCell(Fi, F2,Fi,F2, cm, coui)  A 


This  ‘type’  is  formalized  by 
shape(T)  =  (3 1p.T(p))  A 

VV(T(p)4T(pt))  A  (T(p)  A  T(p.O)  -4  T(p.l))  A 
(T(P)  A  T(p.l)  -4  T(p.O)). 
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Second,  we  say  that  a  tree  X  has  shape  T  ( X  is  of  type 
T),  if  all  its  positions  constitutes  a  subset  of  the  leafs 
of  T. 

is-of-shap e(X,T)  =  Vxp.X(p)  ->  leaf (p,T). 

Finally,  we  combine  these  to  formalize  shape_cond 
that  holds  when  the  trees  A ,  and  S,  which  represent 
base-two  numbers,  have  the  same  shape  T. 

shape_cond(A,  B,  S,  T)  =  shape(T)  A  is_of_shape(A,T)  A 
is_of_shape(B,T)  A  is_of_shape(5,T). 

To  complete  our  behavioral  model,  we  define  sev¬ 
eral  auxiliary  predicates  in  Figure  5  that  allow  us  to 
traverse  the  leaves  of  a  valid  input  tree  from  left  to 
right.  For  a  tree  T  satisfying  the  shape  predicate  and 
a  position  p ,  first  (p,  T)  checks  if  p  is  the  left-most  leaf  in 
T  and  last(p,  T)  checks  if  p  is  the  right-most  leaf  in  T. 
The  predicate  next(p,  q,  T)  checks  if  p  and  q  are  leaves 
in  T  and  q  is  the  next  leaf  to  the  right  of  q.  Using  these, 
our  behavioral  model  can  be  defined  analogously  to  the 
first  behavioral  model  in  Section  4. 

adder  _beh(A,  B ,  5,  cm,  cout)  =  32T,  C. 
shape_cond(A,  B,  S,T)  A 

Vxp.  leaf  (p,  T)  ->*  modjtwo(A(p),B(p),C(p),5(p))  A 
3xp.first(p,T)  A  (tin ++ C(p))  A 
3lp.  last(p,T)  A  atJeastJtwo(A(p),B(p),C(p),couU  A 
Vxp,g.  leaf  (p,T)  A  next (p,q,T) 

atJeast-two (A(p),B(p),  C(p),C(g))  . 

5.4  Verification 

We  can  now  verify  that  our  structural  model  of  the 
Cl  A  is  equivalent  to  our  behavioral  model,  i.e.,  that  a 
Cla  of  any  size  actually  adds  its  inputs.  We  formalize 
this  as 

V2A,  1 9,  S.  V°cm,  cout. 

c\a(A ,  B, 5,  tin ,  cout)  adder_beh(A, B ,  5, tin , cout ) . 

This  formula  is  proved  by  MONA  in  one  second. 

6  Conclusion  and  Prospects 

Monadic  logics  have  been  studied  by  logicians  for 
over  forty  years,  but  are  almost  unknown  to  engineers. 
We  have  argued  that  these  logics  are  natural  general¬ 
izations  of  Boolean  logic  that  can  be  motivated  from  an 
engineering  perspective  and  used  to  model  many  more 
kinds  of  systems.  They  are  decidable,  which  means 
that  verification  can  be  automated,  and  they  have  an 


leaf (p,T)  =  T(p)  A  -T(p.O)  A  -T(p.  1) 

no de(p,T)=T(p)  A  T(p.O)  A  T(p.l) 

path(p,X)  =  leaf(p,  X)  A 

^x.(X(x)^X(x^))  A  (X(x.0)->-*X(x.l))A 
(X(x.l)  ->  -nX(x.O)) 

next(p,g,T)  =  p  ^  q  A  leaf (p,T)  A  leaf (7,  T)  A 
32P,  Q.  path(p,  P)  A  path(g,  Q)  A 
3ls.  32S.  P(s)  A  Q(s)  A  P(s.O)  A 
<2(s.l)  A  path(s,5)  A 
Vxii.  ( P(u.O )  A  u  7^  s  — S(u.O))  A 
(Q(u.l)  A  ii  ^  s  — 5(u.l)) 

first(p,T)  =  leaf (p,T)  A  32X.  path(p,X)  A 
->  <  v  -«T(v)) 

last(p,  T)  =  leaf (p,T)  A  32X.  path(p,  X)  A 
V1t*.X(ti.O)  (Vxt;.  u.l  <v^  ->T(v)) 

Figure  5.  Auxiliary  predicates  for  the  Cla 


operational  side  as  formulae  correspond  to  automata 
over  strings  or  trees.  This  opens  possibilities  for  de¬ 
signing  modeling  environments  that  support  a  variety 
of  formal  activities  including  verification,  simulation, 
and  testing. 

One  drawback  is  that,  in  some  cases,  problems  must 
be  modeled  at  too  low  a  level  of  abstraction:  in  WS2S 
everything  must  be  mapped  onto  binary  branching 
Boolean  trees.  This  can  make  models  difficult  to  de¬ 
sign  and  understand  (witness  the  concern  with  ‘shape 
constraints’  in  Section  4.2)  and  lead  to  modeling  errors. 

Our  current  work  aims  at  building  a  high-level  spec¬ 
ification  language  that  eliminates  this  drawback.  We 
have  developed  a  language  that  allows  users  to  spec¬ 
ify  tree  languages  in  WS2S  using  data-types  like  those 
found  in  modern  programming  languages  [2].  More¬ 
over,  we  are  building  a  translator  from  the  high-level 
hardware  description  language  VHDL  to  WS1S  and 
integrating  the  MONA  system  with  other  development 
and  verification  tools  [3]. 
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Abstract 

EXOR  Ternary  Decision  Diagrams  (EXOR-TDDs)  proved 
useful  in  several  applications  in  logic  design.  This  paper 
shows  that  EXOR-TDDs  are  also  useful  in  functional  de¬ 
composition.  We  prove  that  checking  simple  disjoint  de¬ 
composition  of  f  with  respect  to  a  subset  of  r  variables 
is  equivalent  to  building  the  EXOR-TDD  for  a  matrix¬ 
valued  function  of  (n  —  r)  variables.  Transferring  the  prob¬ 
lem  of  decomposition  into  the  problem  of  building  EXOR- 
TDDs  permits  formulation  of  a  simple  decomposition  pro¬ 
cedure.  The  same  method  applies  to  simple  disjoint  bi¬ 
decomposition  of  switching  functions. 

1.  Introduction 

Decomposition  of  switching  functions  is  an  important  task, 
since  when  decomposition  is  possible,  it  leads  to  many  ad¬ 
vantages  in  network  synthesis  [10],  [12].  At  the  same  time, 
this  is  a  difficult  task.  For  example,  simple  disjoint  decom¬ 
position  is  a  restricted  class  of  functional  decompositions 
defined  as  follows. 

Definition  1:  An  n-variable  switching  function  /  is 
simple  disjoint  decomposable  with  respect  to  the  set  of  first 
r  variables  iff  it  can  be  represented  as 

f{x i , . . . ,  xn)  =  g{h(x i , . . . ,  *r),  xr+i  (1) 

where  g  and  h  are  switching  functions  of  r  and  (n  -  r) 
variables,  respectively. 

For  a  given  function,  to  disclose  even  such  decomposition 
is  both  time  and  space  consuming. 

The  relation  (1)  expresses  decomposition  with  respect  to 
the  first  r  variables  in  /.  We  assume  that  we  have  developed 
a  procedure  to  check  and  perform  such  decomposition  for 
a  given  /,  if  the  decomposition  is  possible. 

The  same  decomposition  procedure  can  be  used  to  check 
decomposability  with  respect  to  any  other  subset  of  r  vari¬ 
ables  xju ..  .,Xjr,  if  instead  /  we  consider  an  auxiliary 
function  /'  derived  from  /  by  the  permutation  of  variables 
Xj i  , .  ♦  •  >  Xjr  and  x\ , . . . ,  xr . 

Decomposition  of  a  given  /  is  possible  iff  there  are  some 
relationships  among  the  values  of  /.  In  general,  these  re¬ 
lationships  reduce  to  the  requirement  that  there  are  some 
isomorphic  parts  in  /.  For  a  chosen  representation  for  /, 
these  isomorphic  parts  may  be  properly  defined  and  spe¬ 
cified.  For  example,  if  decomposition  is  performed  through 
a  decomposition  chart,  we  should  check  equality  or  other¬ 
wise  of  some  columns,  respectively  rows,  in  the  chart  [2].  If 
/  is  given  by  some  analytic  expressions  as  Positive  Polarity 


Reed-Muller  expressions,  or  Sum-of- Products  expressions, 
we  should  check  whether  literals  in  product  terms  belong  to 
some  predetermined  subsets  of  variables  [15].  If  /  is  given 
by  the  truth- vector,  we  should  check  equality  of  some  sub¬ 
vectors  in  it.  If  decomposition  of  /  is  performed  through 
a  decision  diagram  for  /,  we  should  check  equality  of  some 
subtrees  in  the  decision  diagram  [3],  [10]. 

There  are  many  decomposition  methods  for  different 
definitions  of  the  decompositions  proposed  in  the  literature 
from  the  first  considerations  of  the  decomposability  [2] ,  up 
to  the  related  recent  publications  [3],  [4],  [7],  [8],  [9],  [10], 
[13],  [15],  [17].  Different  methods  are  based  on  different 
representations  of  switching  functions,  attempting  to  take 
advantages  from  peculiar  properties  of  these,  possibly  re¬ 
duced  or  compact,  representations.  However,  hardly  any  of 
these  methods  exceeds  exhaustive  search  for,  in  some  sense, 
isomorphic  or  equal  parts  in  some  representation  for  /. 

Present  decomposition  methods  start  from  the  existing 
representations  for  switching  functions.  These  represent¬ 
ations  are  intended  to  provide  as  compact  representations 
for  /  as  possible,  or  are  adapted  to  some  particular  applic¬ 
ations.  The  decomposition  methods  are  directed  towards 
algorithms  for  efficient  manipulations  with  these  represent¬ 
ations  to  check  decomposability. 

A  proper  solution  of  the  decomposition  problem  would 
be  to  define  purposely  a  data  structure  for  representation 
of  switching  functions,  where  criteria  for  decomposability 
in  a  given  form  would  be  apparent.  However,  this  is  hardly 
possible,  at  least  it  is  not  easy  within  the  algebraic  struc¬ 
tures  usually  used  in  switching  theory.  A  reason  is  that 
in  these  structures,  relationships  among  /  and  decompos¬ 
ition  parts  h  and  g  of  /,  are  complicated.  Therefore,  it 
is  not  easy  to  define  a  data  structure  which  at  the  same 
time  ensure  a  compact  representation  for  /  and  where  the 
decomposability  criterion  is  simply  expressed. 

In  this  paper,  we  are  trying  to  follow  a  way  between  these 
two  opposite  requirements.  We  have  looked  over  existing 
data  structures  for  representation  of  switching  functions  for 
that  where  conditions  for  decomposition  of  the  form  (1)  are 
simply  expressed.  We  have  noted  that  EXOR  ternary  de¬ 
cision  diagrams  (EXOR-TDDs)  [11]  are  such  a  data  struc¬ 
ture  for  the  following  reasons. 

EXOR-TDDs  are  introduced  for  minimization  of  AND- 
EXOR  expressions  [11].  In  spectral  transforms  interpret¬ 
ation  of  DDs  [19],  EXOR-TDDs  are  graphical  representa¬ 
tions  of  function  expansions  in  terms  of  an  extended  basis 
consisting  of  3n  functions  [20].  Thanks  to  the  extension 
of  the  basis,  the  information  content  of  an  EXOR-TDD 
is  considerably  raised.  Thus,  in  EXOR-TDDs,  we  sac- 
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rifice  in  part  compactness,  and  the  reword  is  highly  in¬ 
creased  amount  of  information  contained  in  the  EXOR- 
TDDs.  We  have  pointed  out  that  in  EXOR-TDDs,  the 
conditions  for  decomposability  of  the  form  (1)  are  inher¬ 
ently  contained,  since  they  have  been  already  checked  dur¬ 
ing  we  build  EXOR-TDD  for  /.  That  permits  formulation 
of  a  simple  procedure  to  find  decomposition  of  the  form  (1) 
through  EXOR-TDDs.  The  same  method  applies  to  detec¬ 
tion  of  simple  disjoint  bi-decomposition  [15].  Separation 
into  the  necessary  and  the  sufficient  conditions,  saves  many 
computations  in  the  case  when  the  required  decomposition 
or  bi-decomposition  is  impossible. 

2.  Matrix- Valued  DDs 

2.1  Matrix-valued  MTBDDs 

Multi-Terminal  Binary  Decision  Diagrams  (MTBDDs)  [6], 
are  a  generalization  of  Binary  Decision  Diagrams  (BDDs) 
[5]  derived  by  allowing  integers  or  complex  numbers  as  val¬ 
ues  of  constant  nodes.  Therefore,  MTBDDs  represents  in¬ 
teger  or  complex- valued  functions  on  finite  dyadic  groups. 
Generalization  to  functions  on  arbitrary  groups  is  straight¬ 
forward.  In  this  case,  the  nodes  in  MTBDDs  are  replaced 
by  the  nodes  with  more  than  two  outgoing  edges. 

Application  of  non- Abelian  algebraic  structures  in  engin¬ 
eering  practice  [21],  implies  consideration  of  matrix- valued 
functions. 

Definition  2:  A  matrix- valued  function  is  a  function 
defined  as  a  mapping  f  :  G  -¥  Ma, 6,  where  G  is  a  finite 
not  necessarily  Abelian  group,  and  Ma,b  is  the  set  of  (a  x  b) 
matrices  whose  entries  take  values  in  a  field  P  that  may  be 
the  complex  field  Cora  finite  (Galois)  field  GF(p). 

We  denote  by  Pa,fc(G)  the  space  of  matrix-valued  func¬ 
tions  on  G  whose  values  are  (a  x  b)  matrices  with  entries 
in  P.  To  represent  functions  in  Pa,6(G),  the  matrix- valued 
Decision  Diagrams  are  introduced  [18]. 

Definition  3:  The  matrix-valued  Decision  Diagrams 
(mvDDs)  are  DDs  whose  constant  nodes  take  values  in 

If  G  is  the  finite  dyadic  group,  and  P  =  C,  then 
mvDDs  in  Definition  3  become  the  matrix-valued  MTBDDs 
(mvMTBDDs).  If  Afa>i,  and  P  =  GF{ 2),  mvMTBDDs  be¬ 
come  MTBDD  used  in  [14].  A  mvMTBDD  is  shown  in  Fig. 
2  in  Example  1. 

2.2  Matrix-valued  EXOR-TDDs 

Ternary  Decision  Diagrams  (TDDs)  are  a  generalization  of 
BDDs  derived  by  allowing  the  third  outgoing  edge  for  each 
node  [10],  [11].  As  in  BDDs,  the  first  two  outgoing  edges 
of  a  node  at  the  i-th  level  in  TDDs  point  to  the  values 
/0  =  f(xi  =  0),  and  f\  =  /(x,-  =  1).  The  third  edge  points 
to  /2  —  /o  *  fiy  where  *  denotes  an  operation  over  logic 
values  0,1,  and  u,  with  u  denoting  unspecified  logic  value. 

Definition  4:  [11]  EXOR-TDDs  are  TDDs  where  *  is  the 
addition  in  GF( 2),  usually  denoted  as  logic  EXOR. 

EXOR-TDDs  are  derived  by  the  reduction  of  EXOR- 
Ternary  decision  trees  (EXOR-TDTs)  [11].  In  an  EXOR- 
TDT,  the  constant  nodes  represent  logic  values  0  or  1.  For 


a  given  /,  these  values  are  determined  a  elements  of  the 
extended  truth- vector  Fe  for  /.  As  shown  in  [20],  Fe  is  the 
spectrum  for  /  with  respect  to  the  Extended  Reed-Muller 
(ERM)  transform. 

Definition  5:  Fe  for  an  n- variable  switching  func¬ 
tion  f(x i,...,£n)  given  by  the  truth- vector  F  = 
[/( 0), . . . ,  f(2n  -  1)]T  is  defined  by 

n  [10‘ 

Fe  =  E(n)F,  E(n)  =  0E(l),  E(l)  =  0  1  , 

i=i  1  1  _ 

where  E(n)  is  the  ERM-transform  matrix.  Calculations  are 
performed  in  GF(2). 

From  this  spectral  interpretation,  if  /  is  given  by  a  BDD, 
the  EXOR-TDD  can  be  derived  by  performing  calculations 
determined  by  E(l)  at  each  node  in  the  BDD  [20].  Similar 
to  the  generalization  of  BDDs  into  MTBDDs,  and  further, 
into  mvMTBDDs,  we  introduce  the  matrix- valued  EXOR- 
TDDs  (mvEXOR-TDDs). 

Definition  6:  Matrix-valued  EXOR-TDDs  are  a  gener¬ 
alization  of  EXOR-TDDs  derived  by  allowing  the  matrices 
over  GF(2)  as  values  of  constant  nodes. 

Fig.  3  shows  mvEXOR-TDT  for  /  in  Example  1,  and 
Fig.  4  shows  the  corresponding  mvEXOR-TDD.  The  first 
two  outgoing  edges  in  a  node  are  labeled  by  j/,-  and  yi ,  re¬ 
spectively.  The  third  edge  points  to  /o  ©  /i-  Thus,  it  is 
denoted  by  1,  which  resembles  that  y{  0  yi  =  1. 

3.  Decomposition  through  mvEXOR-TDDs 

The  Ashenhurst  decomposition  theorem  [2],  can  be  writ¬ 
ten  in  terms  of  the  truth- vector  for  /  as  follows. 

Theorem  1:  For  an  n- variable  switching  function  /,  we 
split  the  truth-vector  F  =  [/( 0), . . . ,  /(2n  —  1)]T  into  2r  sub¬ 
vectors  of  2n~r  successive  function  values.  Thus,  F  is  writ¬ 
ten  as  a  matrix-valued  vector  Fm  =  [Fo, . . . ,  F2»-r_1]T. 
The  function  /  is  simple  disjoint  decomposable  with  re¬ 
spect  to  the  first  r  variables  iff  in  Fm  there  are  at  most  two 
different  subvectors  F*. 

In  this  case,  to  each  subvector  we  assign  a  value  j  6 
GF(2),  providing  that  the  same  value  is  assigned  to  the 
equal  vectors.  These  values  are  considered  as  elements  of 
a  vector  H  of  order  2r.  The  vector  H  thus  generated  is 
the  truth- vector  for  h  in  (1).  The  truth- vector  of  g  is  G  = 
[Fio,Fji]Tj  where  F,j  is  the  subvector  to  which  the  value 
j  £  GF( 2)  is  assigned. 

The  same  theorem  can  be  written  in  terms  of  mvEXOR- 
TDDs  as  follows. 

We  assign  to  /  a  matrix- valued  function  fm  defined  in 
2r  points.  fm  is  given  by  Fm  as  defined  in  Theorem  1. 
A  mvMTBDD  for  fm  can  be  easily  derived  from  BDD  for 
/,  by  simply  cutting  the  last  r  non-terminal  nodes  in  the 
BDD.  Subfunctions  represented  by  subtrees  rooted  at  the 
nodes  at  the  (r  —  l)st  level  in  the  BDD  for  /  are  used  as 
values  of  constant  nodes  in  the  mvMTBDD  for  fm * 

We  derive  mvEXOR-TDD  for  fm  by  processing  nodes 
in  the  mvMTBDD  for  fm  by  the  rule  described  by  E(l), 
in  the  same  way  as  we  derive  EXOR-TDD  for  /  from  the 
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BDD  for  /.  Note  that  mvEXOR-TDD  for  /  is  EXOR-TDD 
for  fm  which  is  assigned  to  /. 

Let  X  and  Y  be  two  arbitrary  vectors  of  order  2n~r  over 
GF( 2),  and  0  the  zero- vector  of  the  same  order.  Let  R  = 
{X,  Y,  X  0  Y,  0},  where  0  is  the  componentwise  EXOR. 

Theorem  2:  A  switching  function  /  is  simple  disjoint  de¬ 
composable  with  respect  to  a  subset  of  r  variables  iff  the 
matrix-valued  EXOR-TDD  for  fm  assigned  to  /  has  at 
most  four  constant  nodes  whose  values  are  in  R . 

Proof.  The  proof  follows  from  Theorem  1  and  definition 
of  EXOR-TDDs.  From  this  theorem,  to  check  decompos- 
ability,  we  have  to  compare  all  the  subvectors  F:  to  see  how 
many  different  there  are.  We  can  do  that  by  the  compon¬ 
entwise  EXOR  of  the  subvectors  F,  .  From  transitivity,  we 
do  not  need  to  form  EXOR  of  all  the  subvectors.  Some 
EXOR  sums  are  covered  by  the  existence  of  some  others. 
From  definition  of  EXOR-TDDs,  we  calculate  for  the  third 
outgoing  edges  all  the  required  EXOR  sums  which  are  ne¬ 
cessary  to  check  the  decomposability.  Q.E.D. 

In  mvEXOR-TDD  for  /,  h  in  (1)  is  determined  by  labels 
at  the  edges  in  the  paths  from  the  root  node  to  the  constant 
nodes  X  and  Y.  Thus,  there  are  two  choices  for  h.  We 
denote  them  by  hx  and  hy .  The  function  hx  is  determined 
as  the  logic  sum  of  labels  at  the  edges  in  the  paths  from  the 
root  node  to  the  constant  node  X.  hy  is  determined  in  the 
same  way  with  respect  to  the  paths  to  the  constant  node 
Y. 

The  function  g  in  (1)  is  determined  by  the  values  of  con¬ 
stant  nodes  X  and  Y,  respectively,  depending  on  the  chosen 
h}  hx  or  hy.  Thus,  gx  corresponding  to  hx  is  given  by 
the  truth-vector  G*  =  [X,  Y]T.  Similar,  the  truth-vector 
for  gy  is  Gy  =  [Y,  X]r. 

3.1  Discussion 

The  simple  disjoint  decomposability  permits  at  most  two 
different  subvectors  in  Fm.  These  subvectors  produce  iso¬ 
morphic  subtrees  in  the  BDD  for  /. 

From  definition  of  EMR-transform,  it  may  be  at  most 
four  different  subvectors  in  Fe.  They  are  defined  as  in  R  in 
Theorem  2. 

By  building  mvEXOR-TDD  for  fm  assigned  to  /,  we 
actually  perform  comparison  of  subtrees  in  the  BDD  for  /. 
From  definition  of  EXOR-TDDs,  comparison  is  transferred 
into  comparison  of  constant  nodes  in  the  mvEXOR-TDD. 
Thus,  it  is  done  automatically  and  it  is  simplified. 

Complexity  of  BDDs,  expressed  through  the  number  of 
non-terminal  nodes,  approximates  to  0(2n/n)  [16].  Com¬ 
plexity  of  EXOR-TDDs  approximates  to  0(3n/n)  [11]. 
From  that  and  the  previous  considerations,  we  have  the  fol¬ 
lowing  lemma. 

Lemma  1:  Complexity  of  mvEXOR-TDDs  for  functions 
which  are  simple  disjoint  decomposable  with  respect  to  a 
subset  of  r  variables  approximates  0(3n_r/ (n-r)-|-3-2r/r), 
if  we  use  a  good  ordering  of  the  input  variables.  If  we  do 
not  know  the  order,  the  complexity  is  at  most  0(3n/n). 

Proof.  We  build  EXOR-TDD  for  fm  which  has  (n  -  r) 
variables.  There  are  three  constant  nodes,  that  are  (r  x 
1)  matrices.  They  can  be  represented  by  BDDs,  whose 


complexity  is  0(2r  /r).  The  node  0,  can  be  replaced  by  the 
number-valued  node  0.  Thus,  the  Lemma  follows.  Q.E.D. 

For  notation  convenience,  the  subvectors  F,  in  Fm  may 
be  written  as  (p  x  q)  matrices,  with  appropriately  chosen 
values  p  and  q  with  p  +  q  =  2n~~r .  In  this  notation,  the  p 
successive  values  of  F,  are  written  as  a  column  or  row  of  a 
(p  x  g)  matrix. 

The  condition  for  decomposability,  expressed  in  terms  of 
mvEXOR-TDDs,  can  be  split  into  the  necessary  and  suffi¬ 
cient  parts.  The  necessary  condition  is  that  mvEXOR-TDD 
for  a  simple  disjoint  decomposable  function  has  at  most  four 
constant  nodes.  The  sufficient  condition  is  that  the  values 
of  these  nodes  are  in  R  defined  in  Theorem  2. 

For  large  values  of  r,  matrix-valued  constant  nodes  in 
mvEXOR-TDD  can  be  represented  by  BDDs.  Fig.  5  shows 
structure  of  such  a  mvEXOR-TDD.  Thus,  mvEXOR-TDD 
for  /  is  EXOR-TDD  where  the  constant  nodes  are  BDDs 
for  F i  in  F  [18]. 

4.  Procedure 

Theorem  2  permits  to  formulate  a  procedure  for  simple 
disjoint  decomposition  with  respect  to  a  subset  of  first  r 
variables  in  /  through  mvEXOR-TDDs. 

Denote  by  nc  the  number  of  constant  nodes  in  the 
mvEXOR-TDD  for  fm  assigned  to  /.  Denote  by  c;,  i  = 
1, . .  ,nc  the  values  of  these  nodes.  The  set  R  is  as  defined 
in  Theorem  2. 


Decomposition  Procedure 

1.  Given  an  n- variable  function  /.  Assign  fm  to  /  for 
the  required  r. 

2.  Build  up  mvEXOR-TDD  for  fm. 

3.  Check  the  number  of  constant  nodes  in  mvEXOR- 
TDD  from  step  2.  If  nc  >  4,  the  decomposition  is 
impossible.  If  nc  <  4,  check  if  c,  €  R .  If  yes,  the 
decomposition  is  possible. 

Example  1:  Table  I  shows  a  function  of  five  variables, 
which  is  used  in  Examples  11.2.2  and  11.3.1  in  [10].  We 
consider  simple  disjoint  decomposition  of  this  function  with 
respect  to  the  first  r  =  3  variables. 

We  assign  to  /  the  matrix-valued  function  fm  given  by 
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Fig.  1  shows  BDD  for  /,  and  Fig.  2  shows  mvMTBDD 
for  /.  Fig.  3  shows  mvEXOR-TDT  for  /m.  In  this  figure, 


Fig.  4  shows  mvEXOR-TDD  for  /m.  This  mvEXOR- 
TDD  has  four  constant  nodes.  Thus,  the  necessary  condi¬ 
tion  for  decomposition  of  the  form  (1)  is  satisfied.  If  the 
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TABLE  I 

Function  /  in  Example  1. 

[  X3  ,  X4  ,  X5 
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first  two  constant  nodes  are  denoted  by  X  and  Y  as  above, 
the  third  node  is  X0  Y,  while  the  fourth  node  is  0.  Thus,  / 
has  decomposition  of  the  form  (1)  with  respect  to  jci,  #2,  x3, 
and  can  be  written  as  /  =  g(h(x\ ,  ,  x$),  £4,  #5). 

We  follow  the  paths  from  the  root  node  to  the  constant 
nodes  X  and  Y.  The  products  of  labels  in  these  paths 
determine 

hx  -  V1V2V3  +  ViVi  + 

hy  =  yM  +  ymys  +  y^Vs' 

The  values  of  constant  nodes  determine  the  corresponding 
gx  and  gy  as  follows 

G*  =  [[J  J],[S  J]]T  =  [1,0, 0,1, 0,0,1, or, 

Gy  =  [[J  }],[;  S]]  =[0,0, 1,0, 1,0,0, if. 

5.  Bi-Decomposition  through  mvEXOR- 

TDDs 

Simple  disjoint  bi-decomposition  is  another  restricted  class 


Fig.  2.  mvMTBDD  for  /m  in  Example  1. 
TABLE  II 

Function  /  in  Example  2. 

I  X3 ,  X4 ,  X5 
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of  functional  decomposition  [15].  As  shown  in  [15],  it  is 
useful  in  some  applications,  for  example  in  realizations  with 
programmable  devices  having  two-input  circuits  at  the  out¬ 
puts. 

Definition  7:  An  n- variable  switching  function  /  is  simple 
disjoint  bi-decomposable  iff  it  can  be  represented  in  the 
form 

/(ari,. .  .,xn)  =  g{hi(xiy . .  .,^r),h2(xr+i, . .  .,xn)). 
Expressed  in  terms  of  decomposition  charts,  /  is  simple 
disjoint  bi-decomposable  iff  there  are  at  most  two  distinct 
columns  and  rows,  at  the  same  time.  Expressed  in  terms 
of  the  truth- vectors,  it  is  required  that  there  are  at  most 
two  different  subvectors  of  order  2r  in  the  truth- vectors  for 
f{xl  1  •  •  • ,  Xr  j  #r+l  j  *  •  *  1  ®n)  &nd  f{xr+\  ,  .  .  . ,  Xnj  X\,  ,  .  . ,  £r). 

In  terms  of  EXOR-TDDs,  the  conditions  for  simple  disjoint 
bi-decomposition  can  be  expressed  as  follows. 

Theorem  3:  An  n-variable  switching  function  /  is  simple 
disjoint  bi-decomposable,  iff  the  set  R  in  Theorem  2  is  R  — 
{X,Y,1,0J  ,  where  1  and  0  are  the  constant  matrices  whose 
all  the  elements  are  equal  to  1  and  0,  respectively. 

Proof:  The  proof  directly  follows  from  Theorem  2  and 
definition  of  simple  disjoint  bi-decomposition  for  /. 
Example  2:  Consider  a  function  /  given  in  Table  II,  used  in 
Example  5.1  in  [15].  Assume  that  we  want  to  check  EXOR 
bi-decomposability  with  respect  to  the  subsets  of  variables 


157 


/ 


Fig.  3.  mvEXOR-TDT  for  fm  in  Example  1. 


X\  =  and  X2  =  {tf3j  x4}.  Thus,  we  want  to 

represent  /  as  /  =  gi{Xx)  (&g2(X2).  To  make  the  example 
obvious,  we  write  /  as  a  positive  polarity  Reed-Muller 
(PPRM)  expression  f  —  x  1  ©  xxx2  ©  3:33:4  ©  xix2x^x6. 
The  method  derived  from  Theorem  3  is  formulated 
for  decomposition  with  respect  to  the  first  r  variables 
in  /.  Therefore,  we  determine  an  auxiliary  function 
f  (*^3>  ®4>  X5,  *^6)  =  f  {%1 1  X2y  X3y  X4}  X3l  Xq} .  Thus, 

f  =  x3  ©  X3X4  ©  X1X2  ©  X3X4X5X6.  We  associate 
to  f  a  matrix- valued  function  fm.  Elements  of  ffm 
are  matrices  of  four  successive  values  of  /'.  Thus, 

ffm  is  given  by  the  matrix-valued  truth-vector  = 
[0,0,1,X,0,0,1,X,0,0,1,X,1,1,0,Y]T,  where  X  = 


To  check  the  required  bi-decomposability,  we  should  de¬ 
termine  how  many  different  subvectors  there  are  in  .  In 
that  order,  we  should  mutually  compare  all  the  elements  in 
FJ„ .  In  simple  examples,  as  that  we  are  considering,  this  is 
obvious.  However,  in  functions  with  many  variables,  that 
is  a  time  consuming  task,  since  the  number  of  comparisons 
is  large.  In  the  method  we  are  proposing,  we  perform  these 
comparisons  by  building  mvEXOR-TDD  for  F^ . 

Fig.  6  shows  mvEXOR-TDD  for  f'm .  The  constant  nodes 
in  this  mvEXOR-TDD  may  be  denoted  as  X,  Y,  1  and 
0,  respectively.  Thus,  they  belong  to  R  as  defined  in  The¬ 
orem  3,  and  simple  disjoint  bi-decomposition  of  /'  with 
respect  to  the  subsets  of  variables  xXyx2  and  3?3,X4,x5,x6 
is  possible.  It  follows  that  the  same  decomposition  for  / 
is  possible  with  respect  to  the  subsets  of  variables  X\  and 
X2.  Thus,  /  can  be  written  as  /  =  gi(Xx)  g2(X2). 

If  the  node  X  in  the  mvEXOR-TDD  in  Fig.  6  is  taken  as  the 
truth-vector  for  g2,  then  G2  =  [0,0,0, 1]T.  This  is  a  two- 
variable  function  g2  =  ziz2y  or  in  terms  of  variables  in  X2, 
g2  =  X3X4.  We  determine  gx  in  the  required  decomposition 
by  following  the  paths  from  the  root  node  to  the  node  X  in 
the  mvEXOR-TDD  for  /^.  From  Fig.  6  we  determine 

g'l  =  Vi 3/2^33/4  ®  V1V2S/4  ®  2/i  2/22/32/4  ©  Vi 2/22/4 
©2/12/22/32/4  ®  2/12/22/4  ©  2/12/22/4- 


Fig.  5.  Structure  of  mvEXOR-TDTs  with  BDTs  as  constant  nodes. 
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Fig.  6.  mvEXOR-TDD  for  in  Example  2. 

In  terms  of  the  variables  in 

g[  =  X1X2X5X6  0  x{x2xe  0  X1X2X5XQ  0  x\x2xq 
®X\X2XsXs  0  X\X2Xq  0  X\X2Xq. 

We  derive  g\  in  bi-decomposition  for  /  by  performing  on  g[ 
the  permutation  inverse  to  that  used  to  determine  /'  from  /. 
We  pay  attention  that  the  group  of  permutations  is  a  non- 
Abelian  group,  which  determines  the  order  of  permutations 
we  should  perform.  It  follows  that,  in  this  example,  we 
should  permute  x\  x&  and  x2  <->  X5.  Thus, 

gi  =  X6X5X2X1  0  x6x5xi  0  x6x^x2xi  0  x6x5xi 
0^6X5X20:1  0  X6X5X1  ©  X6X5X1. 

After  reduction,  we  get  g\  =  xi  ©tfjx^  0  X1X2X5X6,  which 
is  obvious  from  PPRM  for  /,  The  same  result  is  derived 
in  [15]  by  the  analysis  of  PPRM  for  /. 

6.  Closing  Remarks 

We  considered  simple  disjoint  decomposition  and  simple 
disjoint  bi- decomposition  of  switching  functions  through 
EXOR-TDDs.  It  is  shown  that  to  check  decomposability  or 
bi-decomposability  with  respect  to  r  variables  is  equivalent 
to  build  EXOR-TDD  for  a  matrix- valued  function  of  (n-r) 
variables.  In  that  order,  the  matrix-valued  EXOR-TDDs 
are  introduced.  In  practical  realizations,  these  TDDs  are 
represented  as  EXOR-TDDs  with  BDDs  as  constant  nodes. 
In  practice,  we  do  not  always  look  for  the  fastest  methods, 
using  possibly  quite  specialized  tools.  We  may  not  always 
look  for  methods  and  tools  that  can  handle  the  most  com¬ 
plex  problems.  Sometimes,  we  are  quite  satisfied  if  we  can 
solve  a  few  tasks  with  a  single  tool  that  we  have.  For  that 
case,  we  recommend  EXOR-TDDs. 


Decomposition  and  bi-decomposition  are  two  more  tasks, 
which,  together  with  several  other  tasks  in  logic  design,  can 
be  efficiently  solved  by  EXOR-TDDs.  The  method  is  ap¬ 
plicable  to  functions  of  a  moderate  complexity. 
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Abstract 

In  this  paper ,  an  algorithm  for  generating  mod¬ 
ular  designs  of  Ordered  Multiple  Decision  Diagrams 
(OMDDs)  for  Current-Mode  CMOS  Logic  (CMCL) 
implementation  is  introduced .  The  OMDD  struc¬ 
tures  for  a  set  of  twelve  benchmark  circuits  from  the 
LGSynth93  using  radices  ranging  from  r  =2  to  r  =10 
are  generated  and  compared  in  terms  of  size  and  speed. 
It  is  observed  that  MODDs  with  radices  r  €  { 2 ,  4)  <?} 
result  in  the  smallest  area .  They  also  achieve  the  small¬ 
est  normalized  (with  respect  to  the  AT 2  measure  for  r 
=  2)  AT2 ,  where  A  is  the  area  and  T  is  the  delay . 


1  Introduction 

Multiple  Decision  Diagrams  (MDDs)  are  graph 
structures  which  are  becoming  the  state-of-the-art 
means  for  representation  of  both  binary  and  multiple¬ 
valued  logic  functions  [1][2].  A  number  of  research 
studies  on  MDDs  appeared  recently  in  the  literature. 
These  include  [3]  [4]  [5]  [6]  [7]  [8].  In  [3],  some  of  the  is¬ 
sues  concerning  implementation  of  an  MDD  package 
are  considered.  In  particular,  the  issue  of  adjacent 
level  interchange  to  allow  for  logical  operations  to  be 
performed  on  reduced  OMDDS  (ROMDDs)  has  been 
studied.  A  preliminary  package  has  been  implemented 
in  C.  In  [4]  new  techniques,  e.g.  inverted  edges  and  co- 
factoring  have  been  employed  in  the  synthesis  of  MVL 
MDDs.  The  integration  of  the  proposed  techniques  into 
CAD  environment  has  also  been  made.  In  [5]  a  de¬ 
sign  method  for  mutliple-output  networks  using  time 
divison  multiplexing  and  shared  multi-terminal  MVL 
MDDs  has  been  introduced.  It  is  shown  that  pairing 
of  input  variables  and  pairing  of  output  functions  can 


lead  to  a  substantial  reduction  in  the  number  of  gates 
needed  for  the  implementation  of  functions.  A  compre¬ 
hensive  coverage  of  Ternary  Decision  Diagrams  (TDDs) 
can  be  found  in  [6] . 

In  this  paper,  we  consider  ordered  MDDs  (OMDDs) 
for  representation  and  minimization  of  binary  func¬ 
tions  using  Current-mode  CMOS  (CMCL)  MVL  cir¬ 
cuits  [9]  [10]  [11].  We  have  generated  complete  layout 
for  OMDD  basic  cells  using  CMOS3DLM  design  rules. 
Several  benchmark  circuits  have  been  selected  from  the 
LGSynth93  [12]  and  their  OMDD  structures  are  gener¬ 
ated  for  varying  radices.  HSPICE  level3  modeling  and 
CMOS3DLM  design  rules  are  used  in  estimating  the 
area  and  the  delay  of  the  generated  OMDD  structures 
for  the  benchmark  circuits  considered  [9]. 

2  Background  Material 

We  consider  a  given  r- valued  function  f(X),  where 
X  =  {*i,  x2l  . . xn}  as  a  mapping  fi  Rn  -►  {0,  1}, 
where  R  =  {0,  1,  2,  . . .,  r-1}.  When  writing  a  prod¬ 
uct  term  of  a  function,  the  value  of  each  variable  is 
presented  superscripted.  For  example,  a  product  term 
in  a  4-variable  4-valued  function  can  be  written  as  p 
=  A2BlC°D 3.  The  meaning  of  this  representation  is 
that  the  product  term,  p,  =  lifA=2&B=l&C 
=  0  &  D  =  3.  The  same  product  term  can  be  written 
as  p( A,  B,  C,  D)  =  21034.  A  representation  of  a  an 
n-variable  r— valued  function,  /,  with  k  product  terms 
will  be  given  as  /  =  £(pi,p2,  •  •  •  ,Pk)-  For  example,  a 
4- variable  4- valued  function  will  be  represented  as  /  = 
£(21034,  23214,11104). 

A  MDD  is  acyclic  graph  (DAG)  which  forms  a  canon¬ 
ical  representation  of  a  given  function.  An  example  of 
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an  MDD  is  shown  in  Figure  1  for  the  function  /  = 
£(1013, 1103,  Ills,  112s)  =  A1B°C1  +  A1B1 .  In  order 


Figure  1.  Example  of  a  3-variable  3-valued 
MDD  representation  of  a  binary  function. 


to  make  MDDs  more  useful  structures,  some  restric¬ 
tions  can  be  imposed.  An  OMDD  is  an  MDD  with 
some  ordering  Xi  <  Xj  <  •  •  •  <  Xk  imposed  upon  the 
variables.  Figure  2  shows  the  effect  of  variable  ordering 
for  the  binary  function  F  =  AB  +  CD  +  EG.  In  this 
figure,  a  solid  line  indicates  a  ”0”  value  for  the  variable 
while  a  dashed  line  indicates  a  ”  1”  value  for  the  vari¬ 
able.  As  can  be  seen  from  the  figure,  the  number  of 
nodes  has  been  reduced  from  14  to  6  when  the  variable 
ordering  A,B,C,D,E,Gis  used  instead  of  the  order¬ 
ing  G,  C,  A,  E,  B,  D.  To  further  reduce  the  OMDDs 
in  size,  the  following  two  rules  can  be  imposed. 

1.  Remove  duplicate  nodes  ,  and 

2.  Remove  redundant  nodes  (a  redudant  node  has  all 
its  output  paths  leading  to  the  same  node). 


Figure  2.  The  effect  of  variable  ordering  on 
the  size  of  a  MDD. 


Fig.  3  illustrates  the  effect  of  applying  the  above  rules 
on  the  size  of  the  MDD  shown  in  Fig.  1.  Fig.  4 
illustrates  the  basic  CMCL  MVL  circuits  used.  These 
circuits  are  respectively  the  current  generator,  current 
mirror,  threshold  detector,  and  the  switch.  Detailed 
operation  of  these  cells  can  be  found  in  [10]. 


Figure  3.  The  effect  of  removing  duplicate 
nodes  on  the  size  of  an  OMMS. 


3  OMDDs  Implementation 

The  OMDD  consists  of  a  number  of  identical  nodes 
(cells).  A  schematic  of  a  general  node  is  shown  in  Fig. 
5.  The  truth  table  of  the  cell  is  shown  in  Table  1. 
The  CMCL  MVL  implementation  of  a  node  for  r— 4  is 
shown  in  Fig.  6.  The  cell  has  one  input  and  3  ( r-1 ) 
outputs.  It  should  be  noted  that  the  threshold  element, 
Th(0))  is  not  connected  to  the  P-  mirror.  This  is  be¬ 
cause  the  value  of  the  input  current  is  assumed  to 
be  Iin  >  0.  This  necessitates  the  modification  of  the 
threshold  element  shown  in  Fig.  4  to  become  as  shown 
in  Fig.  7.  This  has  been  done  in  order  to  guarantee 
that  when  V{n  =  0,  then  the  output  of  Th(0)  will  be 
0  due  to  pre-discharging  of  the  node  y  using  the  sig¬ 
nal  CLK.  The  cell  operates  as  follows.  When  Vin  is 
binary  high  and  according  to  the  value  of  the  input  cur¬ 
rent  Iin ,  the  output  of  one  of  the  threshold  elements 
will  be  binary  high  and  the  outputs  of  the  remaining 
threshold  elements  will  be  binary  low.  This  binary  high 
value  will  propagate  through  the  transmission  gates  to 
the  inverter  input  where  it  will  be  inverted.  The  in¬ 
verted  value  will  then  enable  the  corresponding  output 
p  transistor,  thus  producing  a  pure  logic  one  at  the  cor¬ 
responding  output  O*.  It  should  be  noted  that  the 
output  of  the  node  are  voltage  levels.  This  allows  us 
to  derive  multiple  subsequent  (multiple  fan-out)  nodes, 
thus  allowing  the  implementation  of  multiple  functions 
with  a  single  OMDD. 

4  OMDD  Generation 

In  this  section,  we  introduce  the  algorithm  used  to 
generate  the  OMDD  for  a  given  function  using  the  gen- 
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Figure  6.  CMCL  MVL  implementation  of  the 
node  in  an  OMDD  for  r  =  4. 


Vin  Clk 


y 


Figure  7.  The  modified  threshold  element 
used  for  the  Th(0)  in  Fig.  4. 


eral  node  structure  shown  in  Fig.  5  as  the  basic  cell. 


Figure  4.  CMCL  MVL  basic  cells. 


The  Algorithm 


lin 


Figure  5.  Schematic  of  a  cell  (node)  in  an 
OMDD. 


Table  1.  Truth  table  of  the  OMDD  basic  cell. 


In  the  following  algorithm,  the  variables  used  have 
the  following  meanings. 

num  Vars :  the  number  of  variables  in  the  function 
numF :  the  number  of  functions 

node :  the  number  assigned  to  the  node  currently  gen¬ 
erated 

max :  the  maximum  assigned  for  nodes  generated  so  far 

Lij:  the  set  of  minterms  of  function  j  that  may  still 

be  satisfied  at  node  i 

leveli :  the  level  number  of  node  i 

(val,  num):  returns  the  node  number  reached  from 

node  val  along  path  num 

1.  node  •—  3 

3.  levelg  «—  1 

3.  Lq  * —  aet  of  nil  minterms  of  F 

4.  mix  —  3 

5.  for  i  •—  1  to  r-1 

i)  Let  Lmax^>1  be  the  list  of  ill  minterms  of  Lno(|e  with  the  levelnoe(c  th  variable  =  i. 

b)  if  |Lmar+1 1  ss  0  then  (node.i)  —  0. 

c)  if  |Lm„+1|  =  r««mVar.+  l-leve?noe,e  then  („ode,i)  4-  1 

d)  if  Lno<je  is  equal  to  Lj  (where  j  <  node),  then  (node,i)  **•  (j,i). 

e)  if  (aode,i)  hasn’t  been  assigned  yet,  then  max  «—  max  +  1 

(node,i)  «—  max 

8.  if  mix  >  node  then  node  ♦—  node  +  1  and  goto  *—  l«v«lnode  +  1  ■ 

T.  for  node  *—  mix  to  2 

if  (node,i)  =  (node.O)  V  i<r-I  then  for  j  *—  2  to  node-1 
if  (j,k)  s=  node  then  (j,k)  =  (node.O),  V  k  <  r-1 

The  algorithm  used  for  a  given  variable  ordering  it¬ 
eratively  cycles  through  each  node  in  the  OMDD  and 
for  each  path  steming  from  that  node,  the  algorithm  de¬ 
termines  the  subsequent  node  to  which  this  path  leads. 
A  value  of  1(0)  is  assigned  in  order  to  show  that  the 
value  of  the  function  is  1(0)  if  such  path  is  followed. 
The  subsequent  node  to  which  a  path  should  lead  is 
determined  as  follows.  For  each  path,  a  list  of  possible 
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minterms  which  can  still  be  covered  by  following  that 
path  is  generated.  Three  possibilities  exist.  First,  if 
the  list  is  empty,  then  the  path  leads  to  the  value  0 
for  the  function.  Second,  if  the  path  covers  all  possi¬ 
ble  remaining  mintermins  for  the  given  function,  then 
that  path  leads  to  the  value  1  for  the  function.  The 
third  possibility  is  when  neither  of  the  above  two  cases 
is  true.  In  this  cases,  the  path  is  assumed  to  lead  to 
an  intermediate  node  in  the  OMDD  and  a  new  node  is 
generated  unless  an  identical  node  exists  in  the  OMDD 
(no  duplication  nodes  are  allowed).  If  such  node  exists, 
then  the  algorithm  assigns  that  node  to  the  path.  The 
algorithm  continues  in  this  fashion  until  leaf  nodes  are 
reached.  At  that  time,  the  algorithm  cycles  backwards 
through  all  nodes  trying  to  identify  redundant  nodes 
and  remove  them.  The  algorithm  used  is  shown  below. 


4.1  An  Example 

Consider  a  3-variable  3-valued  function  given  by 
F  =  A°BXC 2  +  A1B°C1  +  A1BlC1  +  AlBlC 2. 
=  ]TX0123,  1013,1113,  1123).  For  this  func¬ 

tion,  the  variable  ordering  A,  C,  B  is  assumed. 


Output 

Level 

Set  of  Minterms 

2  3  4  0 

1 

(012,  101,  111,  112) 

3  0  0  5 

2 

(-12) 

4  0  6  5 

2 

(-01,-11,-12) 

5  0  10 

3 

(-1-) 

6  110 

3 

(-0-,  -1-) 

The  program  starts  with  node  2.  The  reason  for  this 
is  that  the  numbers  1  and  0  are  reserved  for  terminal 
nodes.  Using  L2  =  (012,  101,  111,  112),  we  determine 
the  output  paths  of  this  node.  With  the  restriction  A  = 
0,  we  get  L3  =  (-12).  The  *-*  is  just  used  here  to  signify 
that  the  variable  A  has  already  been  covered.  Next, 
we  use  the  restriction  A  =  1.  This  gives  us  L4  =  (-01, 
-11,  -12).  After  this,  we  try  A  =  2.  Since  this  would 
give  us  an  empty  list,  this  value  would  be  0.  Thus,  we 
get  (1,0)  =  3,  (1,1)  =  2,  and  (1,2)  =  0. 

NOTE  :  The  addition  of  the  dashes  here,  although  not 
mentioned  in  the  above  algorithm,  is  necessary.  The 
dashes  show  how  two  minterms  can  share  a  node.  For 
example,  take  the  minterms  012  and  112.  After  the  A 
restriction,  they  become  -12  and  -12.  Thus,  they  now 
appear  identical  and  can  be  both  covered  by  the  same 
nodes  in  the  OMDD, 

5  Experimental  Results 

We  have  applied  the  above  algorithm  to  twelve 
benchmark  circuits  from  the  LGSynth93  [12].  In  each 


of  these  circuits,  the  OMDD  strcutures  has  been  gen¬ 
erated  for  radices  ranging  from  r  =2  to  r  =  10.  The 
OMDD  delay  and  physical  dimension  of  the  OMDDs 
are  estimated.  The  delay  has  been  obtained  using 
HSPICE  CMOS3DLM  parameters,  while  the  area  mea¬ 
surements  were  made  using  the  1.5  micron,  two  metal 
process  in  Cadence .  More  elaboration  on  these  two 
aspects  is  given  below. 


5.1  Delay  Characteristics 


Fig.  8  shows  the  OMDD  structure  generated  for  the 
”9symml”  benchmark  circuit  for  the  case  r  =  2.  There 
are  33  nodes  in  total  in  that  OMDD.  Solid  lines  in  this 
figure  indicate  paths  to  ”0”  value  while  dashed  lines 
represent  paths  to  ”1”  value.  We  have  estimated  the 
delay  for  that  OMDD  as  ToJWfJ9D  “  A variables  ^  'Bnodei 
where  Nvariab\es  represents  the  number  of  variables  in 
the  function  and  Tnode  represents  the  time  delay  of  a 
single  node.  Table  2  shows  the  average  delay  of  a  node 
for  each  of  the  nine  radices  r  =2  to  r  =  10.  These  node 
delay  values  are  obtained  using  HSPICE. 


Figure  8.  The  OMDD  structure  for  the 
"9symmr  benchmark  circuit  for  r  =2. 
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Table  2.  The  average  node  delay  in  "9symmr 
for  different  radices 


Radix 

Average  delay  (ns) 

2 

5.1 

3 

8.4 

4 

11.6 

5 

15.5 

6 

19.7 

7 

23.8 

8 

28.8 

9 

34.0 

10 

39.9 

5.2  Area  Estimation 

We  have  enumerated  the  number  of  nodes  in  the 
OMDD  of  twelve  benchmark  circuits  for  radices  rang¬ 
ing  from  r  =  2  to  r  =  10.  Table  4  lists  the  the  bench¬ 
mark  circuits  and  the  number  of  nodes  in  their  OMDD 
for  the  radices  considered. 


Table  3.  The  number  of  nodes  in  the  OMDD  of 
a  number  of  benchmark  circuits. 


Circuit 

r=2 

r=3 

r— 4 

r=5  . 

r=6 

r=7 

r— 9 

r— 10 

9symml 

33 

62 

17 

40 

37 

33 

11 

28 

26 

C17 

12 

15 

8 

9 

6 

6 

5 

5 

5 

Z5xpl 

127 

66 

43 

35 

27 

23 

19 

18 

16 

a  2 

11 

8 

5 

5 

4 

4 

3 

3 

3 

apex4 

438 

233 

153 

117 

95 

80 

66 

59 

54 

coni 

28 

43 

14 

32 

23 

23 

9 

17 

16 

del 

11 

7 

4 

3 

3 

3 

3 

3 

1 

4s  m 

16 

16 

9 

10 

7 

6 

5 

5 

5 

majority 

9 

11 

5 

9 

6 

8 

4 

5 

5 

mlp4 

243 

127 

80 

64 

52 

42 

35 

33 

29 

rd84 

36 

79 

16 

56 

35 

37 

10 

33 

29 

xor5 

9 

11 

5 

9 

5 

6 

3 

5 

5 

From  table  4,  one  can  see  that  the  number  of  nodes 
in  the  realization  of  a  given  OMDD  is  minimal  for  r 
=  2,  4,  and  8.  In  order  to  estimate  the  actual  silicon 
area  consumed  by  a  given  OMDD,  we  have  investigated 
three  ways  to  estimate  such  area.  These  are: 

1.  Taking  interconnect  into  account:  Here  the  height 
of  the  OMDD  at  its  highest  point  is  multiplied 
by  the  length  of  the  OMDD.  Added  to  this  is  the 
interconnect  area. 

2.  Only  node  area  is  considered:  Here  the  structure 
of  the  OMDD  is  ignored.  We  just  mulitply  the 
number  of  nodes  times  the  area  consumed  by  the 
node  for  a  given  radix  (see  table  4). 

3.  Taking  the  area  of  nodes  and  the  interconnect 
among  nodes  into  account:  Here  we  take  the  area 
consumed  by  interconnects  among  nodes  into  ac¬ 
count. 


Fig.  9  illustrates  the  difference  between  the  area  mea- 
surments  as  given  by  points  2  and  3  above. 


Figure  9.  Two  different  area  measurements. 


Table  4.  The  area  of  the  OMDD  cells  for  differ¬ 
ent  radices. 


Radix 

Cell  Area 

2 

3480 

3 

5371 

4 

8215 

5 

11765 

6 

15580 

7 

20240 

8 

23900 

9 

31360 

10 

37820 

We  have  used  a  combined  measure  in  order  to  assess 
the  merits  of  the  generated  OMDDs.  This  measure  is 
the  AT2  figure  of  merits,  where  A  is  the  area  and  T 
is  the  delay.  In  order  to  compute  such  figure  of  mer¬ 
its,  we  computed  the  product  of  the  number  of  nodes 
in  the  OMDD  (as  shown  in  Table  3)  times  the  area  of 
one  node  (as  shown  in  Table  4)  times  the  square  of  the 
time  delay  of  each  node  (as  shown  in  Table  2).  We  then 
normalized  such  factor  with  respect  to  the  AT2  for  the 
case  r  =  2.  The  resultant  normalized  figures  are  shown 
in  Table  5.  It  is  clear  from  table  5  that  OMDDs  with 
radices  r— 2,  4,  and  8  possess  the  best  figure  of  merits 
measured  by  the  factor  AT 2. 

6  Conclusion 

In  this  paper,  we  have  considered  the  issues  related 
to  synthesis  of  OMDDs  for  current-mode  CMOS  MVL 
implementation.  The  basic  cell  of  the  OMDD  has  been 
designed  and  its  performance  has  been  evaluated  in 
terms  of  the  chip  area  and  speed  at  different  radices 
ranging  from  r= 2  to  r  =  10.  An  algorithm  has  been 
proposed  for  the  generation  of  OMDDs  using  the  design 
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Radix 

Normalized  AT2 

2 

i 

3 

8 

4 

6 

5 

38 

6 

75 

7 

127 

8 

73 

9 

340 

10 

524 

Table  5.  The  AT2  measures  for  the  M9symmr 
benchmark  circuit  for  different  radices 


cell.  The  OMDDs  for  a  number  of  benchmark  circuits 
from  the  LGSynth93  have  been  generated  and  their 
performance  has  been  compared  in  terms  of  the  chipe 
area,  the  delay,  and  the  AT2  figure  of  merits.  It  has 
been  shown  that  radices  r  =2,  4,  and  8  lead  to  OMDDs 
with  the  smallest  area  and  the  best  AT 2 .  An  important 
contribution  of  this  work  is  laying  the  foundation  for  a 
CAD  tool  for  synthesis  of  OMDDS  using  CMCL  MVL 
basic  cells. 
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Abstract 

In  this  paper,  we  propose  a  method  to  represent 
multiple-output  functions  using  shared  multiple¬ 
valued  decision  diagrams  (SMDDs).  We  show  an 
algorithm  for  pairing  the  input  variables  of  binary 
decision  diagrams  (BDDs).  We  also  present  the  pair 
sifting  that  moves  pairs  of  4-valued  input  variables 
to  speed  up  the  normal  sifting,  and  to  produce  com¬ 
pact  SMDDs.  The  size  of  the  SMDD  is  the  total 
number  of  non-terminal  nodes  excluding  the  nodes 
for  output  selection  variables.  We  derive  the  sizes  of 
SMDDs  for  general  functions  and  symmetric  func¬ 
tions.  From  experimental  results,  we  conjecture 
that,  for  n  >  1,  the  sizes  of  SMDDs  for  bit-counting 
functions  (wgt  n)  and  incrementing  functions  (inc  n) 
are  n|k>g2  nj  +  n  —  2*-log2  nJ  and  2n  —  1,  respectively, 
where  n  is  the  number  of  binary  input  variables,  and 
|aj  denotes  the  largest  integer  not  greater  than  a. 
We  also  compare  our  method  with  other  one.  Exper¬ 
imental  results  show  the  efficiency  of  our  method. 

1  Introduction 

Multiple- valued  decision  diagrams  (MDDs)  are  exten¬ 
sions  of  binary  decision  diagrams  (BDDs),  and  are  use¬ 
ful  in  logic  synthesis,  time-division  multiplexing  (TDM) 
realizations,  logic  simulation,  FPGA  design,  etc  [3,  5, 
8,  9,  11,  12,  13,  14].  MDDs  are  usually  smaller  than 
the  corresponding  BDDs,  and  require  fewer  memory 
access  to  evaluate  them  [2,  15].  A  shared  multiple¬ 
valued  decision  diagram  (SMDD)  is  a  set  of  MDDs  that 
compactly  represents  a  multiple-output  function  [2,  6]. 
SMDDs  can  be  used  in  many  applications  such  as  design 
of  multiplexer-based  networks,  design  of  pass-transistor 
logic  networks,  etc  [9,  13,  14].  For  example,  Fig.  1.2 
shows  the  multiplexer- based  network  corresponding  to 
the  SMDD  in  Fig.  1.1.  In  these  applications,  the  re¬ 
duction  of  the  number  of  nodes  in  the  SMDDs  is  im¬ 
portant.  In  [2],  Sasao  and  Butler  used  shared  binary 
decision  diagrams  (SBDDs)  to  find  good  pairs  of  the  in¬ 
put  variables.  In  [6],  Miller  and  Drechsler  used  MDDs 


to  represent  functions  with  4-valued  outputs,  and  min¬ 
imized  them  by  cycle  negations.  This  paper  considers 
the  following  methods  to  construct  compact  SMDDs: 

1.  Pair  the  binary  input  variables  to  make  multiple¬ 
valued  variables. 

2.  Order  the  multiple- valued  variables  in  the  SMDDs. 

We  will  introduce  a  parameter  to  find  good  pairs  of  the 
input  variables.  The  parameter  of  an  input  variable  de¬ 
notes  the  influence  of  the  variable  on  the  size  of  the 
BDD.  We  also  present  an  extension  to  the  sifting  algo¬ 
rithm  that  moves  pairs  of  4-valued  input  variables  to 
speed  up  the  sifting,  and  to  produce  compact  SMDDs. 
Furthermore,  from  experimental  results,  we  derive  for¬ 
mulas  for  the  sizes  of  SMDDs  for  bit-counting  functions 
(wgt  n)  and  incrementing  functions  (inc  n).  The  rest 
of  the  paper  is  organized  as  follows:  Section  2  defines 
various  decision  diagrams,  and  shows  the  properties  of 
SMDDs.  Section  3  presents  an  algorithm  for  pairing 
the  input  variables,  and  shows  optimization  methods 
for  SMDDs.  Section  4  presents  experimental  results  for 
various  benchmark  functions.  It  shows  the  efficiency  of 
our  method,  and  compares  the  sizes  of  SMDDs  obtained 
by  two  different  methods. 

2  Decision  Diagrams 

In  this  section,  we  define  various  decision  diagrams,  and 
present  properties  of  shared  multiple- valued  decision  di¬ 
agrams  (SMDDs).  The  proofs  of  lemmas  and  theorems 
are  omitted  due  to  the  space  limitation. 

Definition  2.1:  Let  F  =  (/o,/i, . . .  ,/m-i).  The  size 
of  a  decision  diagram  (DD)  for  a  function  F, 
denoted  by  size{DD ,  F),  is  the  total  number  of  non¬ 
terminal  nodes  excluding  the  nodes  for  output  selection 
variables . 

Example  2.1:  The  size  of  the  SMDD  in  Fig .  1.1  is 
7.  Note  that  g$  is  the  output  selection  variable  in  the 
SMDD .  ■ 
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valued  complete  decision  tree  using  the  following  reduc¬ 
tion  rules: 

•  Two  nodes  are  merged  into  one  node  if  they  repre¬ 
sent  the  same  function,  and 

•  A  node  v  is  deleted  if  all  the  children  of  v  represent 
the  same  function. 

From  now,  we  will  simply  call  an  ROMDD  as  an  MDD. 

2.2.1  Shared  Multiple- Valued  Decision 
Diagrams 


Figure  1.1:  SMDD. 


X2 


Figure  1.2:  A  multiplexer- based  network  corre¬ 
sponding  to  the  SMDD  in  Fig.  1.1. 

2.1  Binary  Decision  Diagrams 

Binary  decision  diagrams  (BDDs)  are  efficient  represen¬ 
tations  of  logic  functions  [1].  A  shared  binary  decision 
diagram  (SBDD)  is  a  set  of  BDDs  combined  by  a  tree  for 
output  selection  [4,  7],  and  represents  a  multiple- output 
function. 

2.2  Multiple- Valued  Decision  Diagrams 

Let  /  :  {0,1,..., r  -  1}N  — ►  {0,1}.  A  multiple¬ 
valued  decision  diagram  (MDD)  of  an  r- valued  N 
input  variables  function  f(X\y  X2, . . . ,  Xn)  is  a  di¬ 
rected  graph  with  a  root  node  that  has  r  outgoing 
edges  labeled  0, 1, . . . ,  r  —  1  directed  to  nodes  repre¬ 
senting  /( 0,X2,...,Ajv),  /( l,X2,...,Ajv),  and  f(r  - 
l,X2,...,X;v),  respectively.  For  each  of  these  nodes, 
there  are  r  outgoing  edges  which  go  to  nodes  that  have 
r  outgoing  edges,  etc.  A  terminal  node  is  a  node  that 
has  no  outgoing  edges.  It  is  labeled  by  0  or  1  which  cor¬ 
responds  to  a  binary  value  of  the  function  /.  A  reduced 
ordered  MDD  (ROMDD)  is  derived  from  a  multiple- 


Shared  multiple- valued  decision  diagrams  (SMDDs)  rep¬ 
resent  multiple-valued  multiple-output  functions.  An 
SMDD  is  a  set  of  MDDs  combined  by  a  tree  for  out¬ 
put  selection.  Fig.  1.1  shows  an  example  of  an  SMDD, 
where  go  is  the  output  selection  variables  for  functions 
/o  and  /1.  An  SMDD  has  the  following  properties: 

1.  It  easily  checks  the  equivalence  of  two  functions. 

2.  It  shares  isomorphic  sub-graphs  of  MDDs,  and  rep¬ 
resents  a  multiple-output  function  compactly. 


3.  Its  logic  levels  are  smaller  than  BDDs. 


Theorem  2.1:  Let  R  =  {0, 1, . . .  ,r  -  1}  and  B  = 
{0,1}.  Then ,  the  size  of  the  SMDD  for  an  N -input 

TV 

m-output  function  RN  — ►  Bm  is  at  most  min  {m- 

2rk  -  2}. 

Corollary  2.1:  An  arbitrary  N -input  m-output  func¬ 
tion  Rn  Bm  can  be  represented  by  an  SMDD  with 
nodes. 

Lemma  2.1:  Let  R  =  {0, 1, . . .  ,r  —  1}  and  B  =  {0, 1}. 

Then ,  all  the  non-constant  symmetric  functions  RN  — ► 

N 

B  can  be  represented  by  MDDs  with  ^[2(  +i  )  —  2] 

i=  1 


non-terminal  nodes . 


Theorem  2.2:  Let  R  =  {0, l,...,r  —  1}  and  B  = 
{0, 1}.  Then ,  the  size  of  the  SMDD  for  an  N -input  m- 
output  symmetric  function  RN  — ►  Bm  is  at  most 


N 

min  { m  • 

k=i 


f i  +  r  -  ! 


From  here,  we  assume  that  r  =  4. 

Definition  2.2:  Let  wgt  n  be  an  n-input  (|_l°g2nJ  + 
1  )-output  function  that  counts  the  number  of  Vs  in  the 
inputs ,  and  represents  it  by  a  binary  number ,  where  n 
is  the  number  of  binary  input  variables ,  and  [aj  denotes 
the  largest  integer  not  greater  than  a.  It  represents  a 
bit-counting  function . 
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do 


Figure  2.1:  SMDD  for  Figure  2.2:  SMDD  for 
functions  (/o,/i)  =  functions  (/o,/ij/2)  = 

(Xl,£2).  ^3)- 

Definition  2.3:  Let  inc  n  be  an  n-input  (n+\)-output 
function  that  computes  x  +  l,  where  n  is  the  number  of 
binary  input  variables.  It  represents  an  incrementing 
function. 

From  the  experimental  results,  we  have  the  following 
conjectures  for  the  bounds  on  the  sizes  of  SMDDs  for 
wgt  n  and  inc  n: 

Conjecture  2.1:  size(SMDD,wgt  n )  =  n[log2nJ  4- 
n  —  2L1o82  ”J }  where  n  >  1. 

Conjecture  2.2:  size(SMDD,inc  n)  —  2n  —  1,  where 
n  >  1. 

Theorem  2.3:  Let  an  SMDD  represent  m  functions 
fi  =  {i  =  0, 1, . . . ,  m  —  1),  where  Xi  is  a  binary  input 
variable.  Then ,  the  size  of  the  SMDD  is  m. 

Example  2.2:  Figs.  2.1  and  2.2  show  the  SMDDs 
for  functions  (/o,/i)  =  (si,x2)  and  (/o,/i,/2)  = 
(x!,x2,X3),  respectively.  Their  sizes  are  2  and  3 ,  re¬ 
spectively.  ■ 

Note  that  the  size  of  the  SBDD  to  represent  m  functions 
/,•  =  xi  (i  =  0, 1, . . . ,  m  -  1)  is  also  m  [10]. 


Figure  3.1:  BDD  for  Figure  3.2:  BDD  for 
the  function  /o.  the  function  f\. 

the  size  of  the  decision  diagram  are  called  influential. 
We  will  use  a  heuristic  algorithm  to  pair  the  influential 
input  variables  of  BDDs  for  multiple-output  functions. 

3.1.1  Our  Method 

In  this  part,  we  will  show  a  heuristic  algorithm  to  find 
good  pairs  of  the  input  variables. 

Definition  3.1:  Let  U  =  {«i ,  W2,  •  •  • ,  Uk}  be  a  set  of  k 
variables.  Let  U\  C  U,  and  U2  QU.  P  —  {U\,  U2}  is  a 
partition  of  U  iffU\\JU2~U,  and  Uif]U2  =  <j>- 

Example  3.1:  Let  U  =  {mi,U2>M3>M4}  be  a  &et  of  four 
variables.  Then ,  {[«l5  u2\,  [^3,  *4]}  is  a  partition  of  U . 

m 

Definition  3.2:  Let  a  BDD  represent  a  function  f . 
para(xi)  is  the  parameter  of  an  input  variable  X{  at 
height  i  in  the  BDD  with  the  variable  ordering. 
It  denotes  the  level  of  the  BDD  for  Xj.  We  assume  that 
the  smaller  the  value  of  para(x{),  the  more  influential 
the  variable  x,-. 


3  Construction  of  Compact 
SMDDs 

Compact  SMDDs  are  important  to  represent  multiple- 
output  functions  efficiently.  In  this  section,  we  will  show 
heuristic  algorithms  to  optimize  SMDDs.  We  consider 
the  following  approaches  to  reduce  the  sizes  of  SMDDs: 
1)  pairing  of  input  variables;  2)  ordering  of  input  vari¬ 
ables  by  sifting  the  4- valued  variables  [16];  and  3)  order¬ 
ing  of  input  variables  by  sifting  pairs  of  4- valued  vari¬ 
ables. 

3.1  Pairing  of  Input  Variables 

Pairing  of  input  variables  is  important  to  reduce  the 
size  of  SMDDs.  The  input  variables  that  greatly  affect 


Example  3.2:  Figs.  3.1  and  3.2  show  the  BDDs  for 
the  functions  fo  and  f\,  respectively.  The  values  of 
para(xi)  for  both  BDDs  are  shoum  in  the  figures.  For 
example,  X\  is  the  most  influential  variable  in  the  BDD 
in  Fig.  3.1,  since  para(x  1)  is  the  smallest  among  all 
para(xi).  ■ 

Definition  3.3:  Let  F  =  (/o, /i, •  •  • , /m-i)  be  an  n- 
input  m-output  function,  and  parakfai)  be  the  param¬ 
eter  of  the  variable  x,*  in  the  BDD  for  /*.  Then, 
T  =  (Ti,T2>. ..,!■„)*  is  a  total  parameter  vector, 
where  T;  is  calculated  as  follows: 

m  —  1 

Ti  =  para(xi)  =  para*(xt). 

fc=o 
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Figure  3.3:  SMDD  for  the  partition 


Figure  3.4:  SMDD  for  the  partition 


Figure  3.5:  SMDD  for  the  partition 
{[xi,x4],  [x2,x3]}. 


Example  3.3:  Consider  the  functions  in  Example  3.2. 
The  total  parameter  vector  for  F  =  (/o,/i)  ts 


para(xi) 
T  _  para(x 2) 
para(x 3) 
para(x  4) 


Definition  3.4:  The  weight  w(i,j)  for  a  pair  of  input 
variables  X{  and  xj  (i  ^  j)  is  defined  by 
w(i,j)  =para(xi)  ■ para(xj ). 

Example  3.4:  In  the  functions  of  Example  3.2,  the 
weights  are  as  follows:  w(  1,2)  =  12,  u;(l,3)  =  6, 
u/(l,4)  =  32,  w(2, 3)  —  18,  w(2,4)  =  96,  and  w( 3,4)  = 
48.  ■ 


Algorithm  3.1:  (Pairing  the  input  variables) 

Let  F  :  {0,  l}n  — >  {0,  l}m.  Let  Q  be  a  set  of  pairs  of  n 
input  variables,  and  q  E  Q.  Let  W  be  the  list  of  sorted 
weights  for  the  pairs  of  Q. 

1.  Optimize  the  BDD  for  each  output  function  f. 

2.  Calculate  the  total  parameter  vector  T. 

3.  Calculate  the  weight  w(i,j)  for  each  pair  of  input 
variables. 

4.  Make  the  list  of  weights,  W  sorted  in  ascending  or¬ 
der. 

5.  Select  q  €  Q  with  the  smallest  weight  w(i,j)  from 
W,  and  eliminate  the  pairs  that  contain  the  input 
variables  in  q  from  Q.  Update  Q  and  W . 

6.  Repeat  Step  5  until  Q  ^  <j>,  and  make  a  good  parti¬ 
tion  of  input  variables  with  the  selected  pairs. 

Example  3.5:  Consider  the  functions  in  Example  3.2. 
There  are  three  different  ways  of  pairing  four  inputs: 

1) {[*i,ar2]j  [23,^4]}  (SMDD  in  Fig.  3.4), 

2) {[®1j®4]i  [^2,^3]}  (SMDD  in  Fig.  3.5),  and 

3) {[zi,£3],  [x2,X4]}  (SMDD  in  Fig.  3.3). 

3)  is  a  good  partition  of  the  input  variables  according  to 
Algorithm  3.1,  since  w(  1,3)  is  the  smallest  element.  ■ 

3.2  Ordering  of  Input  Variables 

The  ordering  of  input  variables  is  very  important  to  re¬ 
duce  the  sizes  of  SMDDs  [2,  16,  17,  18,  19].  Sifting  is 
an  efficient  method  to  find  a  good  ordering  of  the  input 
variables.  Normal  sifting  [16]  moves  a  single  variable 
at  a  time,  while  group  siftings  [17,  18]  move  more  than 
one  variable  at  a  time.  Group  siftings  are  faster  than 
normal  siftings  to  produce  compact  DDs.  Pair  sifting 
is  one  kind  of  group  siftings  that  moves  a  pair  of  sym¬ 
metric  variables  at  a  time.  In  this  part,  we  will  consider 
the  normal  sifting  and  the  pair  sifting  of  4-valued  in¬ 
put  variables.  In  the  case  of  the  pair  sifting,  we  find 
good  pairs  of  the  4- valued  input  variables  from  MDDs. 
Functions  are  usually  multiple  outputs,  and  it  is  not  so 
easy  to  find  good  pairs  of  the  input  variables  for  all  the 
outputs.  We  use  Algorithm  3.1  to  find  good  pairs  of  the 
input  variables. 

Algorithm  3.2:  (Construction  of  an  SMDD  using  the 
normal  sifting) 

1.  Construct  the  SMDD  by  Algorithm  3.1. 

2.  Use  the  sifting  algorithm  [16]  for  the  4’Valued  input 
variables  such  that  the  size  of  the  SMDD  is  mini¬ 
mized. 


169 


From  now,  we  will  call  this  optimized  SMDD  as  the 
SMDD  with  the  normal  sifting. 

Algorithm  3.3:  (Construction  of  an  SMDD  using  the 
pair  sifting  of  4-valued  variables) 

Let  F  :  {0, 1}”  — >  {0,  l}m. 

1 .  Construct  the  MDDs  for  F  by  Algorithm  3.1. 

2.  Find  good  pairs  of  4-valued  variables  from  the 
MDDs  using  the  similar  techniques  to  Algorithm 
3.1,  and  make  an  initial  variable  ordering  with  the 
pairs. 

3.  Select  a  pair  of  4 -valued  variables  from  the  initial 
ordering,  and  use  the  sifting  algorithm  to  find  the 
position  of  the  pair  such  that  the  size  of  the  initial 
SMDD  is  minimized. 

4.  Repeat  Step  3  until  all  the  pairs  from  the  initial 
ordering  have  been  checked,  and  choose  the  smallest 
SMDD. 

Example  3.6:  Let  {[Xi,X3],  [X2,  X4]}  be  a  good  par¬ 
tition  of  4-valued  input  variables  obtained  by  Algorithm 
3.1 .  Then,  the  initial  variable  ordering  for  the  pairs  is 
(XUX3,X2,X4).  m 

From  now,  we  will  call  this  optimized  SMDD  as  the 
SMDD  with  the  pair  sifting. 

4  Experimental  Results 

We  implemented  C  programs  to  construct  SBDDs  and 
SMDDs.  The  benchmark  functions  were  generated  in 
two  ways:  1)  functions  with  2- valued  inputs  for  SBDDs; 
and  2)  functions  with  4- valued  inputs  for  MDDs  and 
SMDDs  generated  from  the  2- valued  ones.  The  encod¬ 
ings  to  the  4-valued  inputs  were  done  by  using  Algo¬ 
rithm  3.1.  In  the  case  of  odd  number  of  inputs,  one 
input  variable  was  remained  in  the  2-valued  form  for 
MDDs  and  SMDDs.  We  used  the  sifting  algorithm  [16] 
for  input  variables  to  reduce  the  sizes  of  SBDDs  and 
SMDDs.  In  addition,  we  considered  an  extension  to  the 
sifting  that  moved  pairs  of  4-valued  input  variables  to 
produce  compact  SMDDs.  The  symbol  “f”  in  Tables  4.1 
and  4.2  denotes  the  function  with  don’t  cares ,  where 
the  don’t  cares  were  set  to  zero  during  the  experiment. 
Table  4.1  compares  the  sizes  of  SBDDs  and  SMDDs  for 
some  benchmark  functions.  Note  that  the  nodes  for  out¬ 
put  selection  variable  are  not  included  in  the  size.  We 
used  Algorithms  3.1,  3.2,  and  3.3  to  optimize  SMDDs, 
and  chose  the  smaller  ones  between  the  SMDDs  with 
the  normal  sifting  and  the  SMDDs  with  the  pair  sifting. 
The  results  are  shown  in  the  column  headed  with  our 
method.  Table  4.1  showed  that,  for  many  benchmark 
functions, 

size(SMDD ,  F)  <  \  size(SBDD,  F). 


To  show  the  effectiveness  of  Algorithm  3.1,  we  also  com¬ 
pared  our  results  with  the  sizes  of  SMDDs  for  the  best, 
average,  and  worst  cases.  For  n  =  2t  2-valued  input 
variables,  there  exist  Q  =^2^  different  ways  of  pair¬ 
ing  the  inputs.  For  example,  when  n  =  12,  we  have 
Q  =  10395  different  pairings.  We  derived  all  possible 
SMDDs  for  different  pairings  to  find  the  smallest  one. 
The  values  in  the  column  headed  with  best  were  gen¬ 
erated  in  this  way.  We  used  the  sifting  algorithm  [16] 
to  order  the  input  variables.  Also,  the  columns  headed 
with  average  and  worst  were  obtained  at  the  same 
time.  Table  4.1  shows  that,  in  many  cases,  our  algo¬ 
rithms  obtained  SMDDs  that  are  close  to  the  best  ones. 

Table  4.2  compares  the  sizes  of  SMDDs  obtained  by 
two  different  methods.  The  column  headed  with  our 
method  was  obtained  in  the  same  way  as  the  column 
our  method  in  Table  4.1.  This  table  shows  that  our 
method  often  produces  more  compact  SMDDs  than  [2]. 

Table  4.3  compares  the  sizes  of  MDDs  and  SMDDs  for 
larger  benchmark  functions.  In  the  MDDs  of  a  multiple- 
output  function,  each  MDD  was  optimized  indepen¬ 
dently.  So,  MDDs  may  have  different  partitionings  of 
the  2-valued  inputs,  but  have  the  same  ordering  of  the 
2- valued  input  variables.  However,  nodes  in  the  different 
MDDs  are  not  shared  even  if  they  represent  the  same 
function.  The  variable  ordering  for  the  MDDs  was  ob¬ 
tained  from  the  corresponding  optimized  SMDD.  The 
column  headed  with  sift  denotes  the  sizes  of  SMDDs 
which  were  obtained  by  using  the  sifting  of  4- valued  in¬ 
put  variables  (Algorithm  3.2).  On  the  other  hand,  the 
column  headed  with  psift  denotes  the  sizes  of  SMDDs 
which  were  obtained  by  using  the  sifting  of  pairs  of  4- 
valued  input  variables  (Algorithm  3.3).  The  ratios  in 
the  last  column  denote  the  sizes  of  SMDDs  with  the 
pair  sifting  to  the  sizes  of  SMDDs  with  the  normal  sift¬ 
ing.  The  CPU  time  for  the  pair  sifting  in  the  column 
8  includes  not  only  the  time  for  the  variable  ordering, 
but  also  the  time  for  finding  the  pairs  of  4- valued  input 
variables  from  MDDs.  CPU  time  for  both  siftings  in 
Table  4.3  shows  that,  the  pair  sifting  is  faster  than  the 
normal  sifting.  Experimental  results  show  that  MDDs 
are  usually  larger  than  SMDDs.  Sometimes,  MDDs  are 
much  larger  than  SMDDs.  For  example,  the  size  of  the 
MDD  for  e64  is  997,  while  the  size  of  the  SMDD  for 
e64  is  110.  In  addition,  Table  4.3  shows  that,  for  many 
benchmark  functions,  SMDDs  with  the  pair  sifting  are 
smaller  than  SMDDs  with  the  normal  sifting.  For  ac- 
cpla,  apexl,  apex5,  mish,  seq,  xl,  and  xparc,  SMDDs 
with  the  normal  sifting  are  smaller  than  SMDDs  with 
the  pair  sifting.  Sometimes,  the  both  siftings  have  the 
same  size,  e.g.  e64.  Note  that  apex5  is  one  of  the  most 
complex  benchmark  functions  in  Table  4.3,  and  our  pro- 
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Table  4.1:  Comparison  of  the  sizes  of  SBDDs  and  Table  4.2:  Comparison  of  the  sizes  of  SMDDs  obtained 


SMDDs. 


by 


Function 

name 

In 

Out 

SBDD 

SMDD 

Our 

method 

best 

average 

worst 

alu2f 

10 

8 

70 

40 

32 

49.9 

63 

brl 

12 

8 

85 

37* 

37 

60.8 

71 

dc2 

8 

7 

72 

29* 

29 

38.5 

50 

dist 

8 

5 

158 

85 

56 

102.6 

130 

f51m 

8 

8 

76 

33* 

33 

46.7 

62 

fout 

6 

10 

141 

79 

47 

96.3 

112 

max512 

9 

6 

184 

65* 

65 

121.8 

153 

misexl 

8 

7 

44 

20 

15 

24.9 

32 

mlp4 

8 

8 

150 

70 

53 

114.2 

138 

newapla2 

6 

7 

24 

8* 

8 

16.7 

21 

t3 

12 

8 

68 

26* 

26 

39.4 

55 

z5xpl 

7 

10 

79 

38 

34 

48.6 

60 

In:  number  of  inputs;  Out:  number  of  outputs. 


*Our  method  obtained  the  best  solution, 
■(function  with  don’t  cares. 


two  different  methods. 


Function 

name 

In 

Out 

SBDD 

SMDD 

Our 

method 

Sasao 

(2] 

alu2 

10 

8 

70 

40 

44 

alu^ 

14 

8 

525 

365 

389 

apla 

10 

12 

115 

73 

67 

DW 

5 

28 

137 

60 

69 

dk27 

9 

9 

35 

24 

22 

dukei 

l 

22 

29 

366 

297 

281 

misexl 

8 

7 

44 

20 

26 

misex2 

25 

18 

100 

69 

80 

misex3 

14 

14 

557 

310 

322 

rd53 

5 

3 

23 

11 

11 

rd73 

7 

3 

45 

17 

17 

rd84 

8 

4 

60 

24 

24 

sao2 

10 

4 

85 

43 

49 

sqr8 

8 

16 

250 

118 

132 

ual 

14 

8 

701 

355 

380 

vg2 

25 

8 

90 

62 

50 

Table  4.3:  Comparison  of  the  sizes  of  MDDs  and  SMDDs  (SMDDs  with  the  normal  sifting  and  SMDDs  with  the 
pair  sifting). 


Function 

name 

In 

Out 

MDD 

SMDD 

sift 

time1 

psift 

time^ 

ratio3 

accpla 

50 

69 

1623 

1065 

74.25 

1134 

52.10 

1.06 

apexl 

45 

45 

1845 

1109 

56.90 

1245 

35.15 

1.12 

apex5 

117 

88 

1084 

527 

160.71 

582 

115.38 

1.10 

e64 

65 

65 

997 

110 

38.10 

110 

25.32 

1.00 

ex4 

128 

28 

1936 

1487 

35.44 

1435 

20.57 

0.96 

misg 

56 

23 

107 

55 

1.66 

47 

0.73 

0.85 

misn 

94 

43 

165 

103 

4.95 

127 

1.16 

1.23 

opa 

17 

69 

519 

235 

0.15 

178 

0.03 

0.75 

seq 

41 

35 

1262 

730 

12.25 

812 

9.15 

1.11 

soar 

83 

94 

1053 

612 

6.26 

520 

4.71 

0.84 

xl 

51 

35 

981 

776 

29.46 

790 

21.85 

1.01 

x2dn 

82 

56 

315 

142 

5.13 

101 

4.05 

0.71 

x3 

135 

99 

1185 

715 

112.50 

682 

87.35 

0.95 

x7dn 

66 

15 

1016 

490 

15.02 

351 

10.61 

0.71 

xparc 

41 

73 

2644 

1012 

65.70 

1201 

43.50 

1.18 

1.  CPU  time  in  sec  on  a  JU 1/170  to  make  variable  orderings  for  SMDDs  using  the  normal  sifting. 

2.  CPU  time  in  sec  on  a  JU1/170  to  make  variable  orderings  for  SMDDs  using  the  pair  sifting. 

„  _  sizejSMDD)  using  the  pair  sifting 

O.  ratlO  Bize(S M DD)  using  the  normal  lifting  ' 


gram  required  8.52  CPU  seconds  to  convert  the  2-valued 
inputs  into  the  4- valued  ones  by  using  Algorithm  3.1  on 
a  JU1/170  with  160  MB  of  main  memory  (Sun  Ultral- 
170  compatible). 

From  the  experimental  results  (tables  are  not  included), 
we  also  obtained  the  sizes  of  SBDDs  and  SMDDs  to 
represent  wgt  n  (Definition  2.2)  and  inc  n  (Definition 
2.3).  We  observed  that,  for  2  <  n  <  11, 

size(SBDD,vugt  n)=n2  -  n  +  |k>g2  nj  +  1, 
size(SMDD,  wgt  n)=n  [log2  nj  +  n  -  2^°Sj  , 

size(SBDD,inc  n)=3n  —  2,  and 


size(SMDDJinc  n)=2n  —  1, 
where  n  is  the  number  of  2-valued  input  variables,  and 
[aj  denotes  the  largest  integer  not  greater  than  a.  In 
addition,  we  observed  that,  for  n  >  4, 

size(SMDD,wgt  n)<^size(SBDD,wgt  n), 
and  for  n  >  10 

size(SMDD,wgt  n)<-size(SBDD,wgt  n). 
o 

5  Conclusions  and  Comments 

In  this  paper,  we  proposed  a  method  to  represent 
multiple-output  functions  using  shared  multiple-valued 
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decision  diagrams  (SMDDs).  We  also  presented  algo¬ 
rithms  to  pair  the  input  variables  of  binary  decision 
diagrams  (BDDs),  and  to  find  good  orderings  of  the 
multiple-valued  variables  in  the  SMDDs.  We  derived 
the  sizes  of  SMDDs  for  general  functions  and  symmetric 
functions.  Experimental  results  showed  that,  in  many 
cases,  the  size  of  the  SMDD  is  a  half  that  of  the  shared 
binary  decision  diagram  (SBDD).  For  many  benchmark 
functions,  SMDDs  with  the  pair  sifting  are  smaller  than 
SMDDs  with  the  normal  sifting.  We  also  compared  our 
method  with  other  one.  Algorithms  3.1  and  3.3  can 
be  extended  to  group  k  input  variables,  where  k  >  2. 
SMDDs  are  useful  in  many  applications  such  as  de¬ 
sign  of  multiplexer-based  networks  and  design  of  pass- 
transistor  logic  networks  [9,  13,  14]. 
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Abstract 

We  introduce  a  new  implementation  of  a  ternary  adder 
with  four  inputs  and  two  outputs .  This  ternary  adder 
reduces  the  number  of  digits  in  a  multiplication  compared 
with  a  binary  multiplication.  One  advantage  of  the  ternary 
adder  is  that  four  instead  of  three  inputs  within  a  binary 
representation  will  be  summed  up.  In  this  paper  we  will 
compare  the  complexity  of  binary  against  ternary  multi¬ 
pliers.  Timing  diagrams  will  be  given  for  the  binary  and 
the  ternary  case  with  an  optimal  order  of  the  adder  in¬ 
puts.  Finally ,  we  present  a  ternary  carry  look-ahead  cir¬ 
cuit  for  a  further  reduction  of  total  time  delay. 

1.  Introduction 

In  this  paper  we  address  the  problem  of  multiple-valued 
multiplications.  First,  we  will  give  an  overview  about 
recent  approaches  to  optimize  the  multiplication  procedu¬ 
re.  In  the  1950’s  first  investigations  for  the  use  of  quasi- 
ternary  number  representations  were  made.  The  basic  idea 
was  presented  by  Booth  in  1951  [1].  His  idea  was  to 
combine  two  binary  digits  and  to  recode  these  into  signed 
digits.  The  aim  of  his  work  was  to  develop  a  uniform 
algorithm  for  multiplication  of  two  signed  binary  num¬ 
bers.  The  disadvantage  consists  in  an  unfavorable  beha¬ 
viour,  if  sequences  of  "01"  occur.  In  this  case  the  recoded 
quasitemary  number  has  more  covered  digits  than  the 
original  one.  A  decisive  alteration  in  the  recoding  structu¬ 
re  from  two  to  three  bits  by  MacSorley  [11]  led  to  the 
fact,  that  the  modified-Booth  recoding  forms  the  most 
common  recoding  structure  for  binary  multipliers.  It  is  an 
advantage  of  this  recoding  algorithm  that  at  most  half  of 
the  digits  can  be  covered.  By  a  corresponding  design, 
taking  these  properties  into  account,  this  leads  to  a  consi¬ 
derable  saving  in  partial  products  when  using  it  in  a 
multiplier.  In  spite  of  the  quasitemary  recoding,  the 
realization  in  a  hardware  is  carried  out  in  established 
binary  logic  [5]. 

In  parallel,  circuits  using  multiple-valued  logic  (MVL) 
were  developed  [7].  In  recent  years,  a  number  of  papers 
combining  MVL-techniques  and  recoding  algorithms  for 


the  development  of  multipliers  have  been  presented.  Such 
a  combination  was  given  by  Chen  and  Rajashekhara  [2]  in 
CMOS  technique.  But,  the  complexity  for  realizing  the 
multiplier  is  very  high,  due  to  the  fact  that  the  chosen 
radix  is  2  and  only  adders  with  two  inputs  are  used.  A 
mixed  multiple-valued  /binaiy  approach  was  presented  by 
Etiemble  and  Navi  using  current  mode  circuits  [4].  By 
special  3BC-cells  (3-valued  to  binary  current-mode  con¬ 
verters),  ternary  inputs  will  be  recoded  to  binary  outputs 
that  will  be  summed  up  to  multiple-valued  currents.  While 
all  approaches  presented  so  far  work  with  radix  2  instead 
of  higher  radixes,  Kameyawa  et  al.  [10]  gave  an  algo¬ 
rithm  for  multiplication  in  radix  4.  Also  the  circuits  of 
Ishizuka  et  al.  [9]  operate  to  radix  4  using  current  mode 
circuits  and  a  radix-4  redundant  number  system.  Ibrahim 
and  Abdul-Karim  [8]  developed  a  radix  3  2x2  trit  multi¬ 
plier  in  CMOS.  Besides  the  fact  that  the  introduced  ad¬ 
ders  are  too  extensive,  they  use  the  ternary  digital  COS¬ 
MOS  technique,  introduced  by  Mouftah  and  Jordan  [12], 
But,  the  usage  of  resistors  is  of  no  significance  for  todays 
VLSI-realizations. 

None  of  the  approaches  presented  makes  use  of  all 
advantages  of  the  MVL-technique.  On  the  one  hand,  this 
is  based  on  the  usage  of  a  redundancy  in  the  number 
representation  in  combination  with  a  suboptimal  radix.  On 
the  other  hand,  instead  of  an  optimal  radix,  the  possible 
number  of  inputs  for  an  adder  will  not  be  exhausted.  An 
early  attempt  with  an  optimal  combination  of  number  of 
adder-inputs  and  radix  was  done  by  Vranesic  and  Hama- 
cher  in  ternary  multi-threshold  logic  [13].  The  ternary 
multiplier  worked  directly  to  radix  3.  The  introduced 
adder  consists  of  four  inputs  and  accomplished  a  four-to- 
two  reduction  of  summands.  Since  every  output  has  to  be 
connected  to  an  input  of  a  following  adder  -  providing 
that  the  output  is  not  part  of  the  result  -  this  extension 
leads  to  a  considerable  saving  of  adders  compared  to  a 
binary  technique.  A  promising  approach  was  presented  by 
De  and  Sinha  [3]  in  I2L-technique.  Besides  an  optimal 
combination  of  number  of  inputs  and  radix,  they  move  on 
by  parallelism  through  precarry  operations  that  speed  up 
the  multiplication.  In  this  paper  we  present  ternary  multi¬ 
pliers  using  optimal  radix  3,  a  non-redundant  encoding 
scheme  and  adders  with  up  to  four  inputs. 
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In  the  following  section  we  will  derive  some  equations 
needed  for  the  estimations  in  the  later  sections.  In  section 
3  the  sum-  and  the  carry-circuit  of  a  ternary  4-input  adder 
as  well  as  a  simulation  result  is  presented.  Section  4 
compares  the  complexity  of  multiplication  circuits  using 
different  kinds  of  adders.  The  timing  behaviour  of  the 
resulting  multiplication  circuits  is  analyzed  in  section  5. 
For  further  improvements,  ternary  carry  look-ahead  cir¬ 
cuits  are  presented  in  section  6.  The  main  results  are 
finally  summarized  in  the  conclusion. 

2.  Basic  relations 


In  the  following  we  assume  that  both,  multiplicand  and 
multiplicator  U  and  V,  have  the  same  wordlength  n.  This 
is  no  common  limitation,  but  simplifies  the  derived  equa¬ 
tions.  Both  numbers,  U  and  V,  have  the  representation 


U  =  u, 
V  =  v. 


*n-r 


vn-  2* 


(1) 


whereby  zt,  and  vi  can  take  the  values  0  and  ±1.  The 
representation  of  U  and  V  in  (1)  is  assumed  to  radix  3. 
Each  bit  position  i  of  U  and  V  corresponds  to  a  so-called 
weight  which  is  3j  in  the  ternary  case.  For  the  correspon¬ 
ding  decimal  value  follows 


U  =  J>,  3' 
1=0 
n- 1 

V  =  £v,.3f 

f=0 


(2) 


The  representation  in  (2)  illustrates  that  the  ternary  ex¬ 
pression  is  easier  than  a  binary  one,  due  to  the  ternary 
values  0  and  ±1.  The  multiplication  of  U  and  V 
(M=U-V)  leads  to  equation  (3)  [3] 

<3) 

t*0  j- 0  i*yi  j=i-n+\ 


The  multiplication  of  ufy  will  be  called  a  partial  pro¬ 
duct  pp(i+j).  For  the  ternary  case  this  partial  product  can 
only  lie  within  the  range  -1  to  +1.  Usual  adders  sum  up 
several  inputs  of  the  same  weight  and  produce  two  out¬ 
puts,  one  at  the  same  and  one  at  the  next  higher  bit  posi¬ 
tion.  As  we  work  to  radix  3,  the  result  can  lie  within  the 
range  -4  to  +4.  This  means,  however,  that  a  ternary 
adder  can  process  four  inputs  of  the  same  bit  position  in 
contrast  to  three  inputs  of  a  binary  adder.  A  further  ad¬ 
vantage  of  the  ternary  number  representation  is  the  treat¬ 
ment  of  the  sign,  because  it  causes  no  additional  expense. 
Desisting  from  the  recoding  algorithms,  mentioned  in  the 
introduction,  in  binary  technique  we  have  to  spend  one  bit 
more  to  accommodate  the  sign.  By  consideration  of  the 
sign  bit,  we  can  derive  the  relation  for  a  binary  represen¬ 
tation  with  the  same  resolution  as  a  ternary  representation 
to  be 


b  =  tt-log-(.-*  +  1  »  n-1.58  +  1  (4) 

log(2) 


whereby  b  gives  the  number  of  digits  of  a  binary  repre¬ 
sentation.  With  increasing  n,  the  number  of  partial  pro¬ 
ducts  in  a  multiplication  increases  naturally.  Independent 
of  the  radix,  pp(i)  partial  products 


i+l 

2n-M 


for  0£i<n-l 
for  n<i<2n-2 


(5) 


have  to  be  processed  for  every  bit  position.  Summed  up 
from  the  starting  index  0  to  the  complete  wordlength  of 
the  n-n  multiplication  result  2n-2  it  follows 

2o-2 

PP  =  =  «2  (6) 

i=0 

The  higher  number  of  digits  of  a  binary  representation 
consequently  influences  the  number  of  partial  products  PP 
quadratically.  On  the  other  hand,  the  expense  of  ternary 
gates  is  much  higher  than  that  of  binary  ones.  Firstly,  the 
following  analysis  will  show  in  detail,  whether  the  ternary 
technique  is  advantageous  in  contrast  to  the  binary  one  or 
not.  For  this  analysis,  we  have  to  develop  a  ternary  adder 
with  four  inputs  and  two  outputs.  Secondly,  we  will 
analyze  the  effects  of  a  reduction  from  k  inputs  to  l  out¬ 
puts  with  k>l  for  adders  with  k=2  to  4  inputs.  Since 
every  output  of  an  adder  has  to  be  connected  with  an 
input  of  a  following  adder  -  except  this  output  is  part  of 
the  result  -  it  follows  a  nonlinear  relation  between  the 
number  of  adders  and  the  wordlength  n.  This  relation 
should  be  analyzed  in  dependence  on  the  types  of  adders. 
With  this  analysis  we  can  compare  the  complexity  of  both 
binary  and  ternary  multipliers. 


3.  TDDNL  4-input  adder 

As  a  temaiy  switching  technique  we  have  choosen  the 
ternary  dynamic  differential  no  race  logic  (TDDNL) 
technique  [6] .  The  realization  of  logical  operations  with 
one  or  two  inputs  as  well  as  connection  circuits  to  the 
binary  system  are  given  in  [6] .  The  design  of  a  TDDNL- 
adder  with  three  inputs  is  also  shown  there.  This  design 
should  be  expanded  to  four  inputs. 

TDDNL-gates  use  a  logical  array  of  nmos-transistors 
with  three  outputs,  fn ,  fv  and  fpi  i.e.,  a  negative,  zero, 
and  a  positive  output.  In  the  evaluation  phase,  exactly  one 
of  these  three  outputs  must  be  connected  with  the  source 
M.  Thus,  the  development  of  optimal  arrays  is  an  optimi¬ 
zation  problem  that  can  be  solved  by  existing  software 
tools.  In  contrast  to  that,  the  presented  method  for  desi¬ 
gning  the  logic  arrays  is  heuristically.  With  this  heuristi- 
cal  method  we  get  optimal  solutions  for  the  derived  adder 
as  well  as  for  the  carry  look-ahead  gate  presented  in 
section  6. 
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Fig.  1:  Driver  using  ternary  dynamic  differential  no 
race  logic  (TDDNL)  from  [6] 


The  heuristical  method  for  the  determination  of  the 
logical  array,  with  every  variable  used  having  a  fixed 
weight,  can  be  described  as  follows: 

Each  variable  will  be  worked  out  in  a  step.  Firstly ,  if 
variables  have  differing  weights ,  we  have  to  decide ,  which 
variable  is  to  be  used  first.  If  all  variables  are  equal  in 
weight ,  the  order  of  the  variables  is  of  no  importance.  In 
the  first  step ,  all  combinations  of  the  first  variable  will  be 
covered.  All  nodes  arising  from  this  step  will  be  combined 
with  all  three  combinations  of  the  next  variable.  In  the 
following ,  the  actual  sums  will  be  written  to  the  new 
arising  nodes.  The  sums  will  be  calculated  by  the  sum  of 
the  two  variables  processed  up  to  this  point.  All  nodes 
with  the  same  sum  can  be  connected  together.  For  a 
further  simplification  attention  must  be  paid  to  the  fact, 
whether  a  node  eventually  represents  a  fixed  result.  That 
means  that  the  result  is  not  depending  on  the  following 
variables.  In  this  case,  this  node  can  be  connected  to  the 
corresponding  output.  All  other  nodes  will  be  processed  in 
the  same  manner  as  described ,  until  all  variables  have 
been  used. 

The  following  development  of  a  4-input  sum-  and 
carry-circuit  elucidates  this  method.  In  case  of  a  ternary 
adder  with  four  inputs  the  four  variables  w,  x,  y,  and  z 
have  to  be  processed.  The  order  of  the  variables  is  of  no 
significance  because  the  four  inputs  have  the  same  weight. 
After  each  node,  we  will  form  a  sum  that  is  composed  of 
the  complete  path  to  the  source-node  M.  In  the  case  of  a 
ternary  carry -signal  in  fig. 2,  the  sum  after  the  first  varia¬ 
ble  w  at  the  left  node  is  -1,  at  the  middle  node  it  is  0  and 
at  the  right  node  it  is  +1.  These  three  nodes  will  be 
combined  with  the  three  combinations  of  x.  The  sums  at 
the  left  node  connected  with  x  will  be  -2,  with  x°  -1  and 
connected  with  x  it  will  be  0.  After  this  procedure  is  done 
for  all  9  cases,  we  look  for  nodes  with  equal  weights.  In 
the  case  of  the  carry  signal  we  can  reduce  the  nine  nodes 


to  five,  according  to  four  cases  where  the  sums  are  equal. 
The  connection  of  the  nodes  means  that  the  result  is 
independent  on  the  way,  how  this  sum  is  obtained.  Espe¬ 
cially,  an  intermediate  sum  of  1  is  independent  how  it 
comes  about,  either  by  the  combination  w= 1  /  x=0  or  by 
w=0  /  x=l.  The  number  of  nodes  that  require  further 
processing  is  minimal  with  this  method.  Every  one  of  the 
five  nodes  after  x  and  w  with  the  sums  in  the  range  from 
-2  to  2,  must  be  combined  with  the  three  combinations  of 
y.  The  connection  of  nodes  with  the  same  weight  after  y, 
results  in  seven  nodes  in  the  range  from  -3  to  +3.  Becau¬ 
se  of  a  negative  carry  having  to  be  set  if  the  sum  is  equal 
or  less  than  -2,  the  node  with  the  sum  -3  can  be  connec¬ 
ted  with  the  output  Can  in  fig. 2.  Similar  considerations 
eliminate  the  node  with  the  sum  +3.  The  remaining  five 
nodes  have  to  be  combined  with  all  combinations  of  z . 
The  logical  array  obtained  by  this  method  has  a  least 
number  of  transistors  for  this  problem.  As  a  further  pro¬ 
perty  of  this  method,  no  states  will  be  propagated  back  in 
the  network  that  could  cause  a  wrong  result  (compare 
fig.  12  in  [6]). 


Fig.  2:  Logical  array  of  a  ternary  carry-signal 


Fig.  3:  Logical  array  of  a  ternary  sum-signal 


The  logical  array  for  the  sum  is  still  more  favorable 
than  that  of  the  carry,  because  the  modulo-3-operation 
takes  care  for  the  property  that  the  intermediate  sums 
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cannot  exceed  the  range  -1  to  +1.  Regardless  of  the 
number  of  variables  used,  almost  three  nodes  after  every 
step  have  to  be  processed  further.  Both  logical  arrays,  for 
the  sum  and  for  the  carry,  are  given  in  fig.2  and  3.  The 
carry-signal  needs  39  nmos-transistors,  whereby  the  sum- 
circuit  uses  only  30  transistors.  These  circuits  have  to  be 
expanded  by  the  TDDNL-driver  in  fig.  1. 

4.  Complexity  of  multipliers 


We  can  not  explain  our  analysis  in  detail  at  this  point. 
But,  the  results  in  fig.5  show  significant  advantages  of 
ternary  multiplications  against  binary  ones. 
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Fig.  4:  Complexity  of  multipliers  with  n  digits  to  base  3; 
half  adder  HA:  2  inputs;  full  adder  FA:  3  inputs;  four 
input  adder  FI:  4  inputs;  index  is  radix 

For  the  detailed  analysis,  we  have  made  the  same  as¬ 
sumptions  as  in  [6] .  According  to  this  analysis  the  follo¬ 
wing  complexity  results  for  the  different  adder  types: 
binary  adder  with  2  inputs:  complexity  54  and  30  transi¬ 
stors;  binary  adder  with  3  inputs:  complexity  60  and  36 
transistors;  ternary  adder  with  2-inputs:  complexity  90 
and  58  transistors;  ternary  adder  with  3  inputs:  complexi¬ 
ty  127  and  91  transistors;  ternary  adder  with  4  inputs: 
complexity  159  and  119  transistors. 

Fig. 4  shows  the  relations,  standardized  to  the  number 
of  digits  n  as  an  exponent  to  base  3.  For  n=32,  e.g.,  a 
resolution  of  more  than  1 .853*  1015  follows  for  the  ternary 
case,  whereby  in  a  binary  representation  according  to  (4) 
nearly  52  bit  have  to  be  provided.  From  the  above  formu¬ 
las  a  complexity  of  79200  results  for  a  32-trit-multiplier. 
This  is  equal  to  60032  transistors. 

5.  Timing  behaviour 

Fig. 4  reflects  a  comparison  with  respect  to  the  com¬ 
plexity  of  binary  and  ternary  multipliers.  Regarding  to  the 
timing  behaviour  we  can  expect  the  ternary  multiplier  to 
be  superior  in  contrast  to  binary  multipliers,  because 
considerably  less  steps  have  to  be  processed  up  to  the 


final  result.  With  respect  to  the  switching  delays, 
TDDNL-gates  can  be  compared  to  have  a  similar  delay  in 
contrast  to  binary  dynamic  gates.  This  is  obvious,  if  we 
consider  the  logic  block  in  fig. 2  that  is  built  by  n-channel 
MOSFETs  only,  whereby  a  maximal  number  of  four 
transistors  is  connected  in  series.  Besides  a  faster  swit¬ 
ching  property  against  p-channel  MOSFETs,  this  means  a 
lower  input  capacity  too.  For  the  TDDNL-gates  only  the 
propagation  delay  of  a  TDDNL-driver  (fig.l)  has  to  be 
added.  But,  this  delay  is  formed  by  the  switching  of  only 
one  p-channel  transistor. 
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Fig.  5:  Propagation  dela; 
ordering  of  summands  in 
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Fig.  6:  Propagation  delay  according  to  an  opti¬ 
mal  ordering  of  summands  in  a  ternary  multi¬ 
plier  (t—2) 

For  a  more  detailed  analysis,  we  will  make  the  follo¬ 
wing  assumptions:  All  partial  products  will  be  formed  in 
a  standardized  delay  time  t—  1.  The  propagation  delay  for 
a  binary  dynamic  gate  will  also  be  r=  1  from  every  input 
to  both  the  sum-  and  the  carry-output.  To  demonstrate  the 
advantages  of  the  ternary  technique,  we  assume  the 
TDDNL-gates  to  have  a  propagation  delay  of  r=2  from 
any  input  to  any  output. 

In  order  to  optimize  the  circuit  regarding  to  its  timing 
behaviour,  we  will  introduce  a  new  order  technique.  This 
technique  optimizes  the  timing  behaviour,  but  complicates 
the  layout  of  the  complete  circuit.  However,  we  will  keep 
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this  in  the  background  since  efficient  algorithms  exist  for 
solving  these  problems.  The  new  order  technique  functi¬ 
ons  as  follows: 

All  partial  products  will  be  ordered  according  to  their 
magnitudes  for  each  weight  and  will  be  written  into  a  list. 
An  adder  processes  the  2,  3  or  4  (in  proportion  to  the 
adder  used)  fastest  products  and  forms  two  output  signals , 
called  intermediate  sums  is(i).  The  magnitude  of  these 
outputs  is  equal  to  the  largest  magnitude  of  the  inputs  plus 
the  standardized  propagation  delay  as  specified.  The  used 
inputs  are  cancelled  from  the  list  and  the  new  two  magni¬ 
tudes  are  written  into  the  list,  which  will  be  ordered  aga¬ 
in .  This  procedure  will  be  processed  from  the  LSB  to  the 
MSB,  until  only  one  entry  remains  for  every  weight. 

Fig.5  shows  the  behaviour  for  adders  with  three  inputs 
(a)  which  form  an  input-to-output  reduction  from  3  to  1/1 
(sum/carry).  Fig. 6  shows  the  behaviour  with  four  input 
adders  which  form  an  input-to-output  reduction  from  4  to 
1/1.  The  numbers  on  the  left  hand  of  the  ellipses  show 
the  calculated  propagation  delay  that  has  to  be  written  to 
the  same  and  to  the  next  higher  weight.  This  number  has 
to  be  calculated  by  adding  the  time  delay  r  to  the  biggest 
number  in  the  ellipse. 


Fig.  7:  Timing  behaviour  of  a  26  bit  binary 
multiplier 

We  observe  that  the  propagation  delay  of  a  ternary 
multiplier-result  increases  by  two  whereby  that  of  the 
binary  result  increases  by  one  per  weight.  But,  the  crucial 
difference  is  that  we  can  use  a  carry  look-ahead  method 
for  much  lower  weights  in  the  ternary  case  in  contrast  to 
the  binary  case.  This  is  based  on  the  fact  that  only  one 
intermediate  sum  has  this  high  propagation  delay.  All 
other  intermediate  sums  is(i)  are  stable  much  earlier. 
Thus,  a  carry  look-ahead  technique  can  be  used  in  the 
ternary  case  from  the  lowest  bits  up  to  the  MSB  of  the 
result. 

Fig. 8  shows  the  behaviour  of  a  ternary  16-trit  multi¬ 
plier  in  contrast  to  a  binary  26-bit  multiplier  in  fig. 7, 
which  means  nearly  the  same  resolution  for  both  cases. 
The  arrows  mark  the  weight  from  that  onwards  the  usage 


of  a  CLA-circuit  is  sensible.  Desisting  from  the  carry- 
signal  that  is  rippled  through  the  complete  wordlength,  the 
largest  standardized  intermediate  sum  is  1 1  in  the  ternary 
case  in  contrast  to  26  in  the  binary  case.  This  is  in  spite 
of  the  fact  that  the  estimated  propagation  delay  of  the 
ternary  adder  is  twice  as  large  as  that  of  the  binary  adder. 
Furthermore,  it  has  to  be  considered  that  the  wordlength 
of  the  ternary  result  is  reduced  by  a  factor  of  1 .58  compa¬ 
red  to  the  binary  result. 


Fig.  8:  Timing  behaviour  of  a  16  TRIT  ter¬ 
nary  multiplier 

The  concluding  chapter  describes  the  development  of 
the  ternary  CLA-circuits  that  can  be  used  with  the  ternary 
multiplier  circuits. 

6.  TDDNL  carry  look-ahead  technique 

TDDNL-gates  have  the  property  that  the  precharge- 
phase  concludes  with  an  invalid  combination  of  the  out¬ 
put.  Whether  the  output  has  to  be  -1,  0  or  1,  the 
TDDNL-gate  has  to  change  its  state.  Thus,  the  carry 
signal  ripples  through  all  stages  connected  in  series, 
independent  of  the  states  of  the  variables.  A  following 
gate  can  switch  only  when  the  preceding  gate  has  a  valid 
output  combination.  As  a  result,  the  total  propagation 
delay  depends  on  the  number  of  gates,  connected  in  se¬ 
ries.  Thus,  the  aim  of  the  CLA-logic  is  to  speed  up  this 
procedure  by  connecting  the  function  of  two  or  more 
adders  into  one  special  structure. 

The  basic  carry  look-ahead  principle  was  developed  by 
Weinberger  and  Smith  in  1956  [14]  for  binary  logic.  But, 
we  refrain  from  the  usage  of  generate-  and  propagate- 
signals  as  used  in  this  early  work,  because  this  would 
require  two  additional  TDDNL-gates.  Instead,  we  will  use 
the  carry-  and  the  sum-outputs  of  the  circuits  directly.  For 
the  development  of  a  ternary  carry  look-ahead  circuit  we 
assume  that  2  bits  per  weight  have  to  be  processed  in  a 
final  adder. 

Fig. 9  shows  the  CLA-structure  with  3  inputs  cini,  cin, 
and  s(  with  the  weight  1  and  two  inputs  ci+1  and  si+]  with 
the  weight  3.  The  logical  switching  circuit  must  connect 
the  output  cout  n  with  the  source  in  the  case  of  a  sum  equal 
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or  less  than  -5  (=-9+3  +  1).  The  output  coutp  has  to  be 
connected  in  the  case  of  a  sum  equal  or  greater  than  +5 
(=9-3-1).  In  the  last  case,  the  output  coutz  has  to  be  con¬ 
nected  with  source,  if  the  sum  of  the  five  inputs  lies  in 
the  range  -4  to  +4.  The  development  of  the  circuit  is 
conformed  to  the  heuristic  method  described  earlier.  To 
speed  up  the  switching  of  this  circuit,  the  time  critical 
variable  cini  is  connected  to  the  output  as  closely  as  possi¬ 
ble. 


Fig.  9:  Block  diagram  of  a  3-TRIT 
CLA 

Finally,  fig.  10  shows  the  complete  logic  of  a  3-trit 
CLA-circuit.  As  an  argument  against  such  a  structure  one 
can  remark  that  the  switching  delay  of  seven  transistors  in 
series  is  very  high.  But,  six  of  the  seven  inputs  are  stable 
much  earlier  than  the  critical  input  cin ,. 


coutn  coutz  coutp 


Fig.  10:  Switching  circuit  of  a  3-TRIT  CLA,  groups  of 
3  transistors  connected  with  x,  x°  und  x 


7.  Conclusion 

We  have  derived  formulas  for  the  number  of  adders  in 
an  ti  n  bit  binary  and  ternary  multiplication  structure.  The 
complexity  for  a  binary  array-multiplier  is  nearly  twice  as 
large  as  that  for  a  ternary  multiplier  with  a  corresponding 
resolution.  In  addition,  we  have  shown  the  advatages  of 
the  timing  behaviour  with  an  optimal  order  of  partial 
products  and  intermediate  sums.  The  presented  ternary 
CLA-circuits  can  be  used  to  speed  up  the  processing  of 
the  final  multiplication  result. 
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Abstract 

A  voltage-mode  neuron- MO  S(v  MO  S )  down  literal 
circuit  which  realizes  an  arbitrary  down  literal  func¬ 
tion  is  proposed.  It  provides  the  benefit  that  the  circuit 
can  be  easily  fabricated  by  standard  CM  OS  process ,  in¬ 
stead  of  the  multi-level  ion  implantation  applied  in  the 
conventional  circuit.  It  has  a  variable  threshold  volt¬ 
age  by  way  of  controlling  only  two  bias  voltages.  Its 
noise  margin  and  switching  sensitivity  are  greater  than 
those  of  variable-threshold  C-vMOS  inverter  presented 
by  Shibata.  The  threshold  voltage  errors  of  the  circuit 
caused  by  device  parameters  mismatch  is  also  analysed. 
Using  the  vMOS  down  literal  circuits,  literal  and  T- 
gate  circuits  are  also  presented.  Performances  of  the 
proposed  circuits  are  evaluated  using  HSPICE  simula¬ 
tions  with  MO  SIS  2.0  fim  CMOS  device  parameters. 

1  Introduction 

With  the  development  of  multi-valued  logic  (MVL) 
theory,  the  need  for  MVL  circuit  design  is  increased. 
There  are  two  kinds  of  MVL  circuit  generally  used: 
voltage-mode  and  current-mode.  Current-mode  MVL 
circuits  are  popular  with  many  researchers  for  their  ad¬ 
vantage  that  the  algebraic  weighted  sum  or  difference 
of  input  currents  can  be  created  easily,  requiring  in¬ 
active  or  passive  components[l][2].  However,  because 
they  are  current-mode,  power  consumption  problem  is 
serious.  Conversely,  voltage-mode  MVL  circuits  have 
the  advantage  of  low  power  consumption.  But  since 
they  require  the  multi-level  ion  implantation  process 
technology  to  realize  multi- threshold  voltages  in  MOS 
transistors,  complex  fabrication  and  high  cost  are  in¬ 
evitable  [1].  In  order  to  solve  these  problems,  we  try 
to  use  a  general  CMOS  process  to  realize  voltage-mode 
MVL  circuits. 

A  neuron-MOS  ( z/MOS )  transistor  is  a  kind  of  novel 
transistor  device  with  multi-input  gates.  It  was  pro¬ 


posed  by  Shibata  and  Ohmi  in  1992  [3].  The  z/MOS 
transistor  is  characterized  by  variable  threshold  voltage 
achieved  by  controlling  the  voltages  of  the  multi-input 
gates.  Applying  this  feature  to  voltage-mode  MVL  cir¬ 
cuits,  we  present  a  z/MOS  down  literal  circuit  in  this 
paper.  The  proposed  circuit  has  greater  noise  mar¬ 
gin  and  switching  sensitivity  than  Shibata’s  variable- 
threshold  C- z/MOS  inverter  that  can  also  realizes  down 
literal  function  in  MVL  [3]. 

Basing  on  z/MOS  down  literal  circuits,  literal  and  T- 
gate  circuits  can  be  easily  composed.  All  the  proposed 
circuits  are  verified  by  HSPICE  simulations  with  MO- 
SIS  2.0/zm  CMOS  device  parameters.  Because  z/MOS 
can  be  fabricated  by  the  standard  CMOS  process  with 
a  two- poly  layer,  the  z/MOS  volt  age- mode  MVL  cir¬ 
cuits  can  be  realized  without  the  multi-level  ion  im¬ 
plantation  process  technology.  The  ease  of  fabrication 
will  make  voltage-mode  MVL  circuits  more  practically, 
and  the  development  of  MVL  hardware  will  push  the 
MVL  research  work  forward. 

2.  z/MOS  down  literal  circuit 

2.1.  Neuron-MOS  (z/MOS)  transistor 

A  z/MOS  transistor  is  the  device  composed  of  a 
floating-gate  and  multi-input  gates  that  are  capac- 
itively  interacting  with  the  floating-gate.  Figure  1 
shows  the  illustration  of  z/MOS  transistor.  Co  is  the 
oxide  capacitance  between  the  floating-gate  and  the 
substrate.  In  the  case  of  a  fc-input  n-channel  z/MOS, 
capacitances  between  the  multiple  input  gates  and  the 
floating-gate  are  defined  as  Ci,C2-..Cfc  in  order  from 
the  drain  side.  When  the  floating-gate  to  source  volt¬ 
age  (Vps)  is  larger  than  the  threshold  voltage  (Vr  seen 
from  the  floating-gate,  the  z/MOS  switches  ON,  with 
the  saturation  drain-source  current  Ids  represented  as 
Eq.(l).  The  corresponding  transconductance  parame¬ 
ter  K ,  floating-gate  voltage  VF  and  capacitive  weight 
Wi  are  given  by  Eqs.(2-4)  [2]. 
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n-ChanneJ  vMOS  p-Channel  vMOS  substrate  (bulk) 


(a)  Symbolic  representation 


(b)  Capacitive  model 


Figure  1.  Description  of  Neuron-MOS 


I  OS  =  K{Vfs  -  VTf  , 

(1) 

1  w 

K  =  2»C°*T  ’ 

(2) 

k 

VF  =  , 

(3) 

1=1 

Ci 

Wi  =  - T - 

Co  +  Li=i  Cj 

(4) 

where  /z  is  the  electron  mobility,  Cox  is  the  gate  oxide 
capacitance  per  unit  area,  W  is  the  channel  width,  L  is 
the  channel  length,  Ci  is  the  capacitance  between  the 
floating-gate  and  ith  input  gate.  The  initial  charge  of 
the  floating-gate  is  assumed  to  be  zero. 

The  equation  of  Ids( Eq.(l))  can  be  transformed  to 
Eq.(5)  which  having  the  same  form  as  that  of  a  general 
MOS  transistor. 


Ids  =  K*{Vj  -Vs-  Vfj)2  , 

(5) 

where, 

K* 

T 

II 

(6) 

V*. 

VTj 

=  ^[VT  +  il-W^Vs-^WiVil 

•  (7) 

Prom  Eq.(7),  it  is  known  that  the  equivalent  threshold 
voltage  V£  seen  from  j-th  input  gate  can  be  varied  by 
controlling  the  other  input-gate  voltages.  Thereupon, 
in  one  chip  MOS  transistors  with  multiple  thresh¬ 
old  voltages  can  be  realized  by  using  z/MOS,  without 
multi-level  ion  implantation.  Employing  this  feature 
in  voltage-mode  MVL  circuit,  we  proposed  the  z/MOS 
down  literal  circuit. 

2.2.  Circuit  description 

A  down  literal  circuit  is  the  fundamental  element  in 
voltage-mode  MVL  circuits.  MVL  logic  functions  are 
almost  always  developed  from  the  down  literal  func¬ 
tion,  and  most  voltage-mode  MVL  circuits  consist  of 
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(a)  Circuit  description  (b)  Block  diagram 


Figure  2.  z/MOS  down  literal  circuit 


down  literal  circuits.  The  down  literal  function  is  given 
by  Eq.(8). 


Di(x)  =  {  J  1 


x  <  i 
x  >  i  +  1 


(8) 


where  i  €  {0, 1,  ...2?  —  2},  x  €:  {0,  —  1}  in 

jR- valued  system  [1].  The  conventional  voltage-mode 
CMOS  down  literal  circuit  is  composed  of  a  PMOS 
and  an  NMOS.  Its  structure  is  the  same  as  the  bi¬ 
nary  inverter.  But  the  threshold  voltages  of  PMOS 
and  NMOS  are  to  be  fixed  to  different  values  accord¬ 
ing  to  in  Eq.(8).  In  order  to  fabricate  down  literal 
circuits  in  one  chip,  multi-level  ion  implantation  is  nec¬ 
essary  to  implement  multi-threshold  voltages  of  MOS 
transistors,  which  increases  the  complexity  and  cost  of 
fabrication.  To  solve  this  problem,  variable-threshold 
transistor -z/MOS  is  considered  to  replace  the  general 
MOS  transistor  with  fixed  threshold  voltage.  A  z/MOS 
down  literal  circuit  that  can  be  fabricated  by  a  stan¬ 
dard  CMOS  process  is  proposed  here. 

The  z/MOS  down  literal  circuit  consists  of  one  two- 
gate  p-channel  v MOS  (z/PMOS)  and  one  two-gate  n- 
channel  z/MOS  (z/NMOS)  as  shown  in  Fig.2(a).  The 
common  input  to  z/PMOS  and  z/NMOS,  is  taken 
as  the  input  of  the  down  literal  circuit,  and  Hi  >  H2  are 
bias  voltages(Fig.2(a)(b)).  According  to  Eq.(7),  the 
equivalent  threshold  voltage  of  z/NMOS  transistor  (Hn) 
seen  from  the  input  Hn  is  given  by  Eq.(9),  and  the 
equivalent  threshold  voltage  of  z/PMOS  (H*)  is  given 
by  Eq.(10)  in  a  same  way. 

K  =  —vtn  -  ^lvb2  ,  (9) 
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v*  =  —vtp  -  ^vbl  - 

P  Wp2  WP2  Wp2 


-Vvd  ,  (10) 


where  wn(p)i ,  i  €  {1,2}  are  capacitive  weights  of 
z/NMOS(z/PMOS).  Therefore,  this  v MOS  down  literal 
circuit  can  be  taken  as  a  common  binary  CMOS  in¬ 
verter  with  threshold  voltage  V^n  in  NMOS  and  V*p  in 
PMOS  .  Because  V*n^  can  be  varied  by  setting  the 
threshold  voltages  the  inverter  has  different 

threshold  voltages,  which  satisfying  the  requirement  of 
down  literal  circuit  of  MVL. 


2.3.  Transfer  characteristic 


The  transfer  characteristic  of  a  down  literal  circuit  is 
shown  in  Fig.2(c).  VDD  =  (R- l)Vd,  Vd  is  unit  voltage. 
When  V{n  =  xVd  is  greater  than  the  circuit  threshold 
voltage  VTc  =  (*  +  0.5)Vd,  Vout  turns  from  VDD  to 
0.  The  key  of  MVL  down  literal  circuit  is  that  VTC 
should  have  various  values  according  to  in  Eq.(8). 
In  a  manner  analogous  to  a  binary  CMOS  inverter,  the 
transfer  characteristic  shown  in  Fig.2(c)  is  divided  into 
5  regions  accroding  to  the  operation  of  z/MOS  transis¬ 
tors.  [4] 

Region  A:  0  <  V*n  <  Vt*n 

In  this  region,  z/NMOS  is  OFF,  there  is  no  current 
flowing  over  the  circuit,  Vout  =  Vdd- 
Region  B:  Vt*n  <  Vin  <  VTC 

In  this  region,  z/NMOS  is  in  saturation  region,  and 
z/PMOS  in  linear  region. 

Region  C:  =  Vtc 

Both  z/NMOS  and  z/PMOS  operate  in  saturation  re¬ 
gion.  Their  saturation  currents  are  Ip  and  Jn,  respec¬ 
tively. 

In  =  K*n(Vin-Vt*n)2  ,  (11) 

IP  =  K;(Vin  -  VDD  -  V*)2  ,  (12) 


where  transconductance  parameters:  K*  =  Kpwpl  = 

^HoXp^fwp  1.  H-n  —  &nwnl  =  ^2  ^OX n 7^ Wnl1 
is  the  electron  mobility,  Coxp(n )  is  the  gate 
oxide  capacitance  per  unit  area,  Wp^n)/Lp(n)  is  the 
channel  width/length  of  i/P(N)MOS.  is  the 

equivalent  threshold  voltage  of  i/P(N)MOS  seen  from 
V5„(Eqs.(9)(10)). 

In  a  manner  analogous  to  binary  CMOS  inverter,  the 
down  literal  circuit  threshold  voltage  Vjc  is  defined  as 
in  this  C  region  [4].  and  it  can  be  derived  from 
Ip  =  -In- 


Vrc  = 


VpD  +  Vtp  +  \/K% 


1  +  V^I 

A"*  =  Kn  w»i 

R  K*  Kp  w2p2 


(13) 

(14) 


According  to  Eqs.(9)(10)(13)  and  (14),  if  there  are 
Kn  =  Kp,  \Vtp\  =  Vtn  and  w„ i,2  =  wpl<2  =  w  =  \  (ca¬ 
pacitances  between  input  gates  and  floating  gate  are 
identical  in  j/N(P)MOS,  Vrc  shown  in  Eq.(13)  can  be 
simplified  to 

Vrc  =  .  (15) 

Here,  the  restriction  of  Vbi,Vb2  is  given  by  Eq.(16) 
which  is  derived  from  saturation  condition  of  z/MOS 
transistors  in  the  circuit. 


-4Vin<Vbl-Vto<2VDD-4\Vtp\  .  (16) 

According  to  Eq.(15),  VTc  of  this  circuit  can  be  var¬ 
ied  by  controlling  the  values  of  the  bias  volt  ages  (V*>i, 
Vfe2),  and  different  down  literal  functions  can  be  imple¬ 
mented. 

In  this  region,  the  peak  current  (Ipeak)  is  reached, 
and  its  value  is 

Ipeak  =  ~(VTC  +  Vb2-2Vtn)2  .  (17) 

It  is  the  saturation  current  flowing  over  the  z/NMOS 
and  z/PMOS  in  Fig.2(a). 

Region  D:  Vtc  <  Vin  <  Vdd  +  Vt*p 

z/PMOS  is  in  saturation  region,  and  z/NMOS  in  lin¬ 
ear  region. 

Region  E:  Vin  >  VDD  +  Vt* 

In  this  region,  z/PMOS  is  cut-off,  Ip  —  0,  Vout  —  0. 
From  the  analysis  above,  it  is  known  that  the  z/MOS 
down  literal  circuit  operates  like  the  general  CMOS  in¬ 
verter.  The  z/MOS  down  literal  circuit  can  also  be  con¬ 
sidered  as  a  CMOS  inverter  in  which  threshold  voltages 
of  NMOS  and  PMOS  are  replaced  by  the  equivalent 
ones:  V£(Eq.(9))  and  V;*(Eq.(10)),  respectively.  The 
circuit  threshold  voltage  VTc  is  corresponding  to  the 
gate  threshold  voltage  in  the  general  inverter  [4].  Ac¬ 
cording  to  Eq.(15),  Vrc  of  the  proposed  circuit  can  be 
varied  by  controlling  the  values  of  the  bias  voltages(T4i, 
Vb2),  therefore  different  down  literal  functions  can  be 
realized  using  one  proposed  circuit. 

The  mismatch  of  device  parameter  will  lead  to  the 
error  on  Vtc ■  The  error  of  Vtc  Is  defined  in  Eq.(18). 

E  =  Vrc'  -  Vtc  ,  (18) 


where  Vj  C  is  threshold  voltage  of  a  circuit  with  device 
parameter  mismatch.  The  real  value  of  parameter  *  is 

V . 

(1)  In  the  case  of  wf  ^  Aw  —  wf  —  \ 


Ew  — 


-2Aw 
1  +  2  Aw 


Vdd  • 


(19) 
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(2)  In  the  case  of  V{n  *  \Vt'p\,  AVt  =  V‘n  -  \Vt'p\ 

Ey,  =  AV%  .  (20) 

(3)  In  the  case  of  K'R  /  1,  AKR  =  K'R  -  Kr 


EKr  =  (V1+9trfl  ~  (4Hn  ~  2Vdd  +  Hi  -  Vu)  ■ 

(21) 

The  errors  stated  above  can  be  overcome  by  adjust¬ 
ing  the  bias  voltages  V&i ,  Vfo  •  It  was  discussed  in  [5]. 

2.4.  Noise  margin 


Noise  margin  is  an  important  specific  to  a  down  lit¬ 
eral  circuit  in  MVL,  as  well  as  a  CMOS  inverter  in 
binary  logic.  Low  noise  margin  NMi  and  high  noise 
margin  NMh  are  defined  as  Eq.(22)  and  Eq.(23),  re¬ 
spectively. 

NMi  =  Vji  -  VOL  ,  (22) 

NMr  =  Voa  -  VIH  ,  (23) 

where  the  values  of  Vql  =  i  and  Von  —  i  +  1  in  down 
literal  function  Dj.  The  definitions  of  Vji  and  VIH  are 
shown  in  Fig.2(d).  They  can  be  solved  as  Eqs.(24)(25) 
by  the  same  way  as  those  in  CMOS  inverter.  [4] 


Vili  = 


WpD  +  W?p  +  SV?n 
8 


(24) 


V,„  -  ,  (25, 

where  Vt*  =  2Vtp  -  Vbi  +  VDD,  Vt*n  —  2Vtn  -  Vb2  accord¬ 
ing  to  Eqs.(9)(10),  and  w„li2  =  wpi,2  =  w 

The  variable-threshold  C-z/MOS  inverter  presented 
by  Shibata  [3]  has  the  same  function  as  the  z/MOS  down 
literal  circuit  proposed  in  this  paper.  However,  their 
noise  margins  are  different.  Vu,  Vjh  in  the  variable- 
threshold  C-z/MOS  inverter  are  given  by  Eqs.(26)(27), 
respectively. 


ViL2  = 


V///2  = 


3  Vdd  +  3  Vtp  +  5Vjn 
4 

5Vdd  +  §Vtp  +  3  Vtn 
4 


-  |(Hi  +  Vb2) 

-  |(Hi  +  vb2) 


(26) 

(27) 


Then  according  to  Eqs. (19-22),  the  differences  of  noise 
margins  between  the  circuit  of  Shibata  and  the  circuit 
proposed  here  can  be  given  by  Eqs.  (28)  (29). 


AN Ml  =  NML1  -  NMl2  =  —  _  —  ,  (28) 


A NMh  =  NMui  -  NMh 2  =  ■  61  b2  .  (29) 

8 

Therefore,  the  z/MOS  down  literal  circuit  has  greater 
noise  margin  than  the  circuit  proposed  by  Shibata  [3] 
only  if  setting  V&i  >  Vb2. 


Table  1.  Quaternary  down  literal 


Vdd  =  3  Vd 

Vtc 

Hi  + 

Hi 

Ha 

D0(x) 

0.5H 

5Vd 

3H 

2H 

Di(x) 

1.5H 

3H 

2H 

H 

D2{x) 

2.5H 

H 

H 

0 

2.5.  Quaternary  z/MOS  down  literal  circuit 

According  to  Eq.(15),  the  quaternary  down  literal 
circuit  can  be  designed  easily.  There  are  three  kinds 
of  down  literal  functions  in  the  quaternary  system. 
D{(x)(i  =  0,1,2)  can  be  attained  by  setting  V&i  and 
l  to  the  values  stated  in  Table  1. 


Figure  3.  Simulation  results  of  z/MOS  down  lit¬ 
eral  circuit 

Simulations  of  the  quaternary  z/MOS  down  lit¬ 
eral  circuit  were  done  using  HSPICE  with  MOSIS 
2.0//m  CMOS  device  parameters.  Under  the  condi¬ 
tions  of  Vd  =  IV,  Wn/Ln  =  32^m/4^in,  Wp/Lp  = 
80//in/4/zm  (to  make  Kp  =  Kn),  Vtn  —  \VtP\  = 
1.0V,  capacitances  in  z/MOS  0.8pF,  the  DC  analysis  of 
Dq(x),  D\(x),  D2{x)  are  shown  in  Fig.3(a).  The  gradi¬ 
ent  of  Vout  is  about  Vout/Vin  —  42.8.  The  simulation 
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of  Shibata’s  variable- threshold  C-z/MOS  inverter  was 
also  done  using  the  same  device  parameters  as  those 
of  v MOS  down  literal  circuit,  and  its  transfer  charac¬ 
teristic  is  also  shown  in  Fig.3(a).  The  noise  margin  of 
i/MOS  down  literal  circuit  is  about  0.13V  greater  than 
that  of  variable-threshold  C-z/MOS  inverter,  which  is 
accord  with  Eqs.(28)(29).  If  switching  sensitivity  is 
defined  as  S  =  ,  AVfW  =  Vin(Vout=97,5%vDD)  - 

Vin{Vout= 2.5%vdd),  there  are  5  =  12.95  in  the  z/MOS 
down  literal  circuit,  and  5  =  3.27  in  the  variable- 
threshold  C-z/MOS  inverter.  The  delay  time  of  the 
proposed  circuit  is  approximately  1.9  ns  (Fig.3(b)). 
The  peak  currents  of  D0(x),  Di(x),  D2{x)  are  shown 
in  Fig.3(c).  They  are  very  small  with  a  value  of  8.7/zA. 

3.  Applications 

The  down  literal  function  is  one  of  the  essential 
building  block  in  MVL,  and  most  MVL  functions  are 
combinations  of  kinds  of  down  literal  functions.  In  this 
section,  we  propose  quaternary  z/MOS  literal  and  T- 
gate  circuits  using  the  presented  z/MOS  down  literal 
circuits. 


Table  2.  Quaternary  literal 


VDD  =  3Vd 

Hi 

vb2 

Ha 

VM 

F1 

Q9 

2Vd 

Vd 

■e mm 

3H 

ESI 

vd 

0 

F2 

EB1 

mm 

mSm 

Figure  5.  Simulation  results  of  z/MOS  quater¬ 
nary  literal  circuit 


3.1  Quaternary  z/MOS  literal  circuits 


The  quaternary  literal  function  is  expressed  by 
Eq.(30)[l] 


f  1?  -  1  a  <  x  <b 
}  0  Others  , 


(30) 


where  x,a,b  €  {0, 1,2,3}.  The  literal  circuit  can  be 
composed  of  down  literal  circuits.  The  block  diagram 
is  shown  in  Fig.  4.  When  the  bias  voltages  of  this  cir¬ 
cuit  are  set  as  given  in  Table  2  according  to  quaternary 
i/MOS  down  literal  circuit  parameters,  the  quaternary 
literal  functions:  a;11,#12, x22  can  be  realized.  Their 
simulation  results  are  shown  in  Fig. 5.  Simulation  pa¬ 
rameters  are  the  same  as  those  used  in  the  quaternary 
z/MOS  down  literal  circuit. 


Vout 

(Xab) 


Figure  4.  z/MOS  quaternary  literal 


3.2  Quaternary  z/MOS  T-gate  circuit 

The  T-gate  is  a  functional  block  in  MVL.  It  is  of¬ 
ten  applied  in  MVL  image  processing,  such  as  qua¬ 
ternary  pattern  matching.  There  is  a  quaternary  T- 
gate  circuit  with  NMOS  proposed  in  [1].  It  requires 
multi-level  ion  implantation  technology  to  set  5  differ¬ 
ent  threshold  voltages  of  enhancement  and  depletion 
MOS  transistors,  which  increases  the  complexity  and 
the  fabrication  cost. 

Basing  on  quaternary  z/MOS  down  literal  circuit  and 


s 

*““GD”y  =x-0-Y 

T  j 


Vln 


Figure  6.  z/MOS  quaternary  T-gate 
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CMOS  pass  gates,  a  z/MOS  quaternary  T-gate  circuit 
is  built  to  complete  the  T-gate  function  denoted  in 
Eq.(31). 


T(p(hPi,P2,P3]x)=  { 


Po 

Pi 

P2 

P3 


x  =  0 
x  =  1 
x  =  2 
x  =  3 


(31) 


where  ®  €  {0, 1,2,3}.  The  circuit  is 

shown  in  Fig.6.  The  whole  circuit  can  be  fabricated 
only  using  a  standard  CMOS  process.  The  perfor¬ 
mance  of  the  proposed  T-gate  is  validated  by  the  sim¬ 
ulation  result  of  T-gate:  T(0, 1,2,3;  x)  shown  in  Fig.7. 
The  maximum  delay  time  of  this  z/MOS  T-gate  is 
7.95ns.  It  is  reduced  about  12.5  times  compared  with 
that  of  multi-ion  implantation  involved  CP-gate  quan¬ 
tizer  proposed  in  [6]. 


4  Conclusion 


In  this  paper,  the  voltage-mode  down  literal  circuit 
using  two  2-gate  z/MOS  transistors  has  been  proposed. 
Compared  to  the  conventional  circuit,  the  z/MOS  down 
literal  circuit  has  several  advantages.  It  can  be  fabri¬ 
cated  easily  by  general  CMOS  process,  without  multi¬ 
level  ion  implantation  necessary  in  the  conventional 
voltage-mode  MVL  circuits.  It  has  the  flexibility  that 
one  z/MOS  down  literal  circuit  can  be  used  to  real¬ 
ize  different  down  literal  functions  by  setting  bias  volt¬ 
ages.  The  proposed  down  literal  circuit  has  been  veri¬ 
fied  by  HSPICE  simulations  on  the  quaternary  z/MOS 
down  literal  circuit.  It  can  work  under  3V  with  8.7/zA 
peak  consumption  current  and  1.9  ns  delay  time,  and 
has  greater  noise  margin  and  switching  sensitivity  than 
C- z/MOS  inverter  proposed  by  Sliibata.  Basing  on 
the  quaternary  z/MOS  down  literal  circuits,  quaternary 
z/MOS  literal  and  T-gate  circuits  are  also  designed  and 
proved  by  simulations. 

With  the  proposed  circuits,  it  is  possible  to  fabri¬ 
cate  various  voltage-mode  MVL  circuits  by  the  stan¬ 
dard  CMOS  process,  and  reduce  the  cost.  In  addition, 
because  v PMOS  and  z/NMOS  in  the  proposed  down 
literal  circuit  have  their  own  floating  gates,  compared 
with  the  C-z/MOS  inverter  proposed  by  Shibata,  multi¬ 
valued  SRAM  can  be  designed  according  to  [7],  This 
work  will  be  done  in  the  further  research. 
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Abstract 

The  Overlap  Resolution  Number  System  (ORNS) 
employs  bit  level  analog  residue  arithmetic,  and  opens 
up  a  powerful  approach  to  digital  computing.  This  new 
redundant  representation  of  signals ,  with  Continuous 
Valued  Digits,  presents  new  methods  for  binary 
arithmetic  and  digital  signal  processing.  The  number 
system  is  based  on  analog  residue  digits,  as  opposed  to 
binary  or  multiple-valued  digit  levels.  Importantly ; 
arithmetic  in  ORNS  is  tolerant  to  VLSI  circuit 
tolerances.  This  allows  simple  elementary  analog 
circuits  to  be  employed,  targeting  digital  accuracy. 

1.  Introduction 

This  paper  presents  analog  CMOS  circuit  techniques 
for  elementary  ORNS  arithmetic  digit  manipulations. 
The  focus  lies  on  binary  multiplier  structures  that 
internally  employ  Continuous  Valued  Digits.  The 
principles  of  arithmetic  in  the  Overlap  Resolution 
Number  System  differ  significantly  from  the  familiar 
concepts  of  arithmetic  with  binary  or  multiple-valued 
digits:  logic  gates,  or  respectively  discrete  level  analog 
circuits,  are  replaced  by  continuous  valued  digit 
manipulation  circuits. 

Multiple- Valued  Logic  (MVL)  circuit  techniques 
exploit  the  potential  accuracy  of  silicon  circuits  by 
relying  on  more  than  just  the  two  states  on  and  off. 
Modem  circuits  for  MVL  arithmetic  units  successfully 
employ  analog  circuits  for  elementary  digit  operations, 
in  replacement  of  digital  logic  gates  [1]. 

This  paper  discusses  an  alternative  to  binary  and 
MVL  multiplier  structures.  In  the  following  section  we 
review  the  new  theory  of  Continuous  Valued  Digits 
(CVD’s),  and  we  compare  their  properties  with  the 
familiar  discrete  valued  digits  in  a  positional  number 
system  (PNS).  We  will  see  that  already  so  called  binary 
CVD’s  are  capable  of  exploiting  silicon  accuracy  in 


arithmetic  structures,  without  resorting  to  higher  radix 
values. 

Arithmetic  rules  in  ORNS  are  then  reviewed,  and 
their  impact  on  hardware  VLSI  architectures  is 
discussed  with  an  emphasis  on  the  implementation  of 
binary  digital  multiplication  functions.  The  paper  finally 
proceeds  with  introduction  of  analog  CMOS  circuits  for 
CVD’s.  These  circuits  comprise  the  leaf  cells  in 
hardware  architectures  for  arithmetic  building  blocks. 

2.  Radix-B  Overlap  Resolution 

Given  a  real  x  bound  by  X  as  W<X,  we  shall 
represent  it  by  a  set  of  CVD’s  rn ,  with  index  n-K...L 

and  K<L.  We  recall  from  [2]  and  [3]  that  there  exist 
two  methods  to  calculate  CVD’s,  each  arriving  at  the 
same  result.  The  first  method  involves  a  cascaded 
approach,  whereby  we  start  with  the  Most  Significant 

Digit  (MSD)  rL ,  and  compute  it  as  rL  =  B  ~  .  The 

positive  integer  B  is  the  radix,  and  we  shall  further 
assume  B>2 .  For  B  =  2  we  have  the  important  case  of 
binary  ORNS.  Further  digits  n<L  are  calculated  by  the 
cascade  rule: 

rn  =  (r„+l-an+l>B  0) 

with  an  =  [^J ,  whereby  an  is  an  integer  associated 
with  the  CVD  rn .  The  operator  L  J  denotes  flooring 
towards  zero,  such  that  [aj  <|rw| .  As  a  result  we  have 
| rn\<B  and  \an\<B-l .  The  CVD’s  rn  need  not  be  an 

integer.  We  choose  to  select  L  such  that  BL+l> \X\.  A 
rule  for  selecting  K  will  follow  in  the  next  section. 

The  second  method  involves  the  signed  modulo 
operation  amodB  =  a-B\_a/B],  which  we  define  for 
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Table  1:  ORNS  Example  for  *=58.742 


Decimal  (5=  10) 

Binary  ( B 

=2) 

n 

rn 

K 

?/(M) 

rn 

'an 

6 

- 

- 

1.17484 

1 

29.371 

5 

- 

- 

0.34968 

0 

8.742 

4 

- 

- 

0.69936 

0 

17.484 

3 

- 

- 

1.39672 

1 

34.918 

2 

0.58742 

0 

2.9371 

0.79744 

0 

19.936 

1 

5.87420 

5 

29.3710 

1.59488 

1 

39.872 

0 

8.74200 

8 

43.7100 

1.18796 

1 

29.699 

-1 

7.42000 

7 

37.1000 

0.37952 

0 

9.488 

Example  1:  A  value  x>0 ,  limited  by  X-  100 ,  shall 
be  represented  by  CVD’s  in  the  range  0 \iA  to  50jx<4 . 
We  select  two  radix  values,  2?=  10  for  decimal  ORNS,  and 

B  =  2  for  binary  ORNS.  To  satisfy  X<^+X  we  select 
L  -  1  for  the  decimal  case,  and  L  =  6  for  the  binary  case. 
For  both  we  select  K  =  -1 .  The  CVD’s  for  x  -  58.742  are 
presented  in  Table  1.  The  decimal  digit  r2  is  an  EED. 

We  observe,  that  an  are  the  PNS  digits  of  (xBL+lyX. 


integer  as  well  as  real  values  of  a  for  both  signs.  The 
ORNS  basic  expression  is: 

r»  =  (f  ■*i“"+1)nodS  (2) 

Both  methods  may  also  be  used  to  compute  digits 
with  index  n>L.  We  shall  see  in  following  sections  that 
such  excessively  evolved  digits  (EED’s)  serve  arithmetic 
with  CVD’s.  EED’s  are  equivalently  computed  by 
rn>L~rn~/&  conform  the  cascade  rule,  or  by 


^L—tt  +  X 

rnyL  =  d  *x/X  conform  the  basic  expression.  The 

proof  is  simple,  and  follows  from  properties  of  the 
modulo  operation.  We  conclude  that  CVD’s  in  ORNS 

are  of  the  general  form  (aif*)modR ,  with  real  a . 


Notice,  that  the  term  binary  refers  to  the  radix  B  =  2 , 
while  the  digits  rn  are  not  at  all  binary  valued. 

3.  Redundancy 


An  ORNS  number  is  written  as 
Nx  =  ( rL , . . ., . . .rK) ,  with  a  radix  ‘bar’  between 

r0  and  r_{ .  We  typically  use  a  decimal  notation  for  the 

value  of  a  CVD,  and  hence  a  vertical  bar  shall  be  used 
for  the  radix  point  of  the  ORNS  number.  In  this  paper 
we  shall  limit  our  discussions  to  non-negative  values  of 
x. 

A  remarkable  characteristic  of  a  CVD  is,  as  the  name 
implies,  that  a  digit  value  requires  a  form  of  continuous 
symbolization.  Hence,  we  may  also  term  CVD’s  as 
analog  digits ,  if  we  have  an  electronic  implementation 
in  mind.  If  a  linear  electronic  medium  of  our  choice,  for 
instance  a  current,  charge  or  voltage,  ranges  from  0  to 
±Q  units,  then  each  CVD  is  matched  proportionately  to 
an  electronic  quantity  qn  by  qn  =  rn  •  Q/B . 


The  value  x  shall  be  termed  the  root  of  the  number 
Nx  and  it  is  retrieved  from  Nx  by  the  MSD  alone, 

without  any  error:  x  =  r^X/B.  Nx  is  not  an 
approximate  representation,  since  rL  and  x  may 
assume  any  real  value.  However,  there  is  a  practical 
limitation.  With  an  increasing  precision  of  x,  it 
becomes  difficult,  if  not  eventually  impossible,  to 
maintain  that  precision  in  rL  when  a  circuit 

implementation  with  qL  is  envisioned.  We  therefore 
need  to  discuss  the  consequences  of  limited  precision 
digits  in  Nx . 

In  a  positional  number  system  the  integers  dn  are 

stored  as  multiple-valued  digits,  and  arithmetic  is 
performed  with  these  discrete  valued  digits,  ranging 
0...(£-l)  for  jc>0. 
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Table  2:  DVD  and  CVD  Error  Threshold  Examples 


Radix  B 

2 

4 

8 

10 

100 

9  DVD 

50.0% 

16.7% 

7.14% 

5.56% 

0.51% 

9  CVD 

16.7% 

10.0% 

5.56% 

4.55% 

0.50% 

Given  two  neighbouring  digits  rn  and  rn_  i ,  we  are 
able  to  retrieve  an  by  an  =  IrJ ,  and  recalculate  rn  as 
r n  -  on+t'n  i/B .  This  is  a  trivial  result  from  (1),  and 
equally  trivial  we  can  retrieve  an  by  an  =  rn-rn_{/B . 
Given  an  errored  digit  pair  r'n  and  t*n+\ ,  we  restore  r\ 
to  r"n  =  a'n+r'n_/B ,  whereby  we  calculate  the 
associated  integer  as  a'n  =  [r' n-r' n_^/B] ,  whereby  [.] 
denotes  rounding  to  the  nearest  integer.  With 

r'n  =  rn  +  £n  r'n-\~rn-\+Zn-\ »  we  find 

a'n  -  [an +en-en_  /B] .  Hence,  under  the  condition 

|<V2  (3) 

we  have  a'n  =  an .  Provided  that  a'n  -an,  we  find  that 
r"n~rn  +  z'n  contains  error  e'„  =  £rt- \/B  *  The 
success  of  obtaining  a'n  -  an  conveniently  does  not 
depend  on  the  value  of  rn .  We  show  in  [4]  that  the 
condition  in  (3)  implies 

\en\<B/(2(B+\))  (4) 

if  we  assume  equal  error  statistics  among  all  digits.  In 
comparison,  the  error  condition  for  Multiple- Valued 
digits  depends  on  the  digit  level  distances,  which  equals 
one  regardless  of  the  radix.  Hence,  the  error  of  discrete 
valued  digits  (DVD’s)  must  adhere  to  |ej  <  1/2 . 

The  important  relative  implementation  error  condition 
for  CVD’s  and  DVD’s  shall  be  defined  as  |eJ/»<  8  and 

various  values  of  &  are  reported  in  Table  2.  We 
recognize  the  familiar  noise  margin  of  50%  for  binary 
systems,  and  clearly  that  noise  tolerance  is  inferior  for 
CVD’s.  It  is  therefore  not  particularly  envisioned  that 
ORNS  be  employed  for  storage  of  x .  The  important 
gain  of  CVD’s  results  from  arithmetic  properties  which 
we  shall  now  discuss. 


4.  Addition  and  Multiplication 

The  CVD’s  rn(x+y)  of  a  sum  x+y  are  simply 

obtained  digit-wise  and  there  is  no  carry  or  other 
interaction  required  between  neighbouring  digits,  [3], 
[6]: 

rn(.x+y)  =  {rn(x)+rn(y))mo&B  (5) 

We  recall  that  we  have  limited  our  discussion  to 
x,y>0.  The  CVD’s  of  a  product  X  x  with  integer  X 
are: 

rn(Xx)  =  (X  •  r„(x))mod5  (6) 

In  the  special  case  X  =  Bk,  we  have 

r„(Bk-x )  =  rn_k(x) ,  and  if  y  =  ,  then 

v* 

r„(y  •*)  =  (!>*•  r„_k(x)ynodB  (7) 

This  is  the  equivalent  of  the  familiar  shift-and-add 
principle  for  Radix-B  multiplication.  In  binary  ORNS 
the  values  of  Xk  are  limited  to  {0, 1 } ,  and  hence  they 
serve  as  an  on-off  switch  for  summing  analog  voltages, 
currents  or  charges  rn_k(x) .  We  intend  to  represent  the 

multiplier  y  in  binary,  with  digits  Xk . 

The  rules  for  addition  and  multiplication  not  only 
hold  for  the  perfect  CVD’s  rn ,  but  also  for  imperfect 

values  r'n .  Of  course,  CVD  imperfections  of  operands 

in  addition  and  multiplication  propagate  into  the  result 
CVD’s,  but  it  has  been  demonstrated  here,  as  well  as  in 
[2]  and  [4],  that  ORNS  is  robust  against  such  errors. 

Example  2:  Given  x  =  31.89  and  y  =  3.54,  they 
shall  be  summed  in  decimal  ORNS  with  X  =  100 .  Their 
ORNS  numbers  with  approximate  CVD’s  are 
N*x  =  (3.2, 1.9|8.9)  and  N*y  =  (0.4,3.5|5.4) .  The  perfect 

sum  of  errored  digits  is  (3.6,5.4|4.3),  yet  we  shall 
consider  the  errored  sum  with  errored  digits 
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(9) 


W\c+J,  =  (3-61’5-39l4-31)-  We  find 

a'0  =  [5.39-4.31/10]  =  5  and 

r"0  =  5 +4.3 1/10  =  5.431,  and  further  a' ,  =  3  and 
r"[=  3.5431.  We  conclude,  with  a  minor  error,  that 

x+y  =  35.431 .  The  error  is  reduced  by  increasing  the 
number  of  digits. 

a 

5.  ORNS-Digital  Interface 

In  [5]  we  presented  a  method  for  generating  CVD’s 
directly  from  binary  bits,  and  we  discussed  an  ORNS 
based  architecture  for  a  binary  multiplier.  In  the 
remaining  sections  of  this  paper  we  will  employ  our 
combined  knowledge  of  the  rule  for  addition  and  the 
error  tolerance  of  CVD’s,  to  develop  CMOS  current¬ 
mode  analog  circuits  for  Radix-2  (binary)  ORNS. 

Considering  the  novelty  of  this  topic,  we  will  review 
the  generation  of  CVD’s  from  bits,  and  the  retrieval  of 
bits  from  CVD’s.  This  will  allow  us  to  use  the  arithmetic 
properties  of  CVD’s  within  a  binary  multiplier.  This 
encapsulation  of  ORNS  within  a  digital  cast  is  one 
application  of  the  number  system.  It  allows  the 
development  of  arithmetic  standard  cells  in  a  digital 
circuit  library. 

A  second  application  of  ORNS  lies  in  the  immediate 
representation  of  a  signal  value  x  by  CVD’s,  without 
the  intervention  of  a  binary  number  system.  In  essence 
this  is  an  alternative  to  quantization  and  digitalization  of 
a  signal,  whereby  we  are  still  able  to  effectively 
distribute  x  over  a  set  of  digits.  In  this  paper  are 
concerned  with  the  first  application,  in  particular  the 
development  of  a  binary  multiplier. 

A  number  x  is  represented  by  a  weighted  sum  over 
binary  bits  an 

v 

*=  Iv2*  (*> 

n=JC 


Considering  the  range  of  jc  in  Eqn.  (8),  we  select 

X  -  2l  + 1  =  2L + 1 .  We  require  L">K\  but  allow 
L\K<  0 .  Inserting  Eqn.  (8)  in  Eqn.  (2),  and  selecting 
B  =  2  we  find  the  CVD’s  of  x  directly  from  a  weighted 
sum  of  the  bits: 


n=K' 


With  a  modification  of  the  radix  in  Eqn.  (8)  we  can 
obtain  CVD’s  from  non-binary  PNS  digits  by  selecting 
B  accordingly.  We  shall  exploit  the  error  tolerance  of 
Eqn.  (4),  and  limit  the  scope  of  the  summation  in  Eqn. 
(9)  to  only  a  few  (»| /)  bits  per  CVD: 


L-j 


L'-j 

I 


2n+i~U 


n=L'—j—\\r+l 


(10) 


The  relative  digit  error  en  =  e/2  results  from  the 

truncation,  and  it  is  bound  by  [e,,!  <  2”^ .  The  bits  an  of 
x  equal  the  associated  integers  an .  A  typical  value  of 
\|/  =  4  implies  a  4-bit  digital-to-analog  (DA)  conversion 
per  CVD,  regardless  of  the  binary  word  length  of  x . 
Bits  are  calculated  from  CVD’s  by 


We  define  amod+B  =  a+FB  with  integer  /'  such 
that  0 <a+F-B<B  for  real  a .  Digit  r'n  is  now  restored 
by 

r"n  =  {^p-+~a'nyod+B  (12) 


We  obtain  a\  and  r"n  from  least  to  most  significant,  in 
sequence,  utilising  the  corrected  neighbour  r"n_ j .  Of 
course,  the  least  significant  digit  remains  uncorrected: 


6.  Binary  Multiplication 

In  Figure  1  we  present  a  4bit-by-4bit  multiplication 
example.  It  is  noted  that  the  applied  word-length  of  4 
bits  is  not  related  to  \|/  =  4.  There  are  seven  DA 
converters  in  the  top,  producing  digits  0<rn<2.  The 

four  rows  of  switches  are  driven  by  Xk  ‘s.  The  summers 

at  the  bottom  are  4-input  modulo  summers,  and  the 
triangles  are  symbols  for  the  digit  correction  and  bit 
retrieval  operations  of  Eqn.  (1 1)  and  Eqn.  (12).  Digit  r'n 
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Figure  1:  [1001]x[1001]  Multiplication  Example 

is  entered  at  the  top,  and  r"n_  1  at  the  right.  The  output 
r"n  is  obtained  from  the  bottom,  and  a'n  from  the  left. 

The  size  of  the  multiplier  has  been  selected  such,  that 
it  illustrates  the  principles  of  the  architecture.  The 
number  of  columns  can  easily  be  extended  to  v  bits, 
with  typical  values  of  16,  32  and  64.  A  v-by-v 
multiplier  of  such  dimensions  will  consist  of  p-bit 
layers,  each  v  wide.  The  value  of  p  is  determined  by 
the  analog  accuracy  of  the  employed  circuits.  In  our 
example  we  employed  4-bit  analog  accuracy  (p=4), 
and  hence  a  32-by-32  bit  multiplier  consists  of  eight  4- 
bit  layers. 


triangles  in  Figure  1,  and  in  essence  they  refresh  the 
digit  values.  The  DA  converters  are  only  needed  once. 
Their  word-length  is  determined  by  the  targeted  analog 
accuracy  It  is  only  required  to  provide  a  DA  resolution 
that  is  intended  to  be  supported  by  the  accuracy  of  the 
applied  circuits. 

7.  CMOS  Current-Mode  Analog  Circuits 

We  will  now  introduce  two  characteristic  circuits  for 
radix-2  (binary  ORNS)  analog  digits:  the  digit 
correction  circuit  that  performs  the  operations  of  Eqn. 
(11)  and  Eqn.  (12),  and  a  2-input  modulo  summer, 
which  is  a  building  block  for  a  4-input  summer.  In 
Figure  2  we  present  the  two  circuits.  Of  course, 
eventually  the  4-input  summer  is  best  designed  directly. 

The  left  circuit  performs  the  rule  for  addition  in  Eqn. 
(5).  Since  the  sum  within  the  modulo  operation  is 
limited  to  0<(rn(x)+rn(y))<2B ,  we  can  write 

r„(x+y)  =  rn(x)+rn(y)-KB  (13) 

whereby  ice  {0, 1} .  The  draining  currents  10  and  II 
represent  the  summands  rn(x)  and  rn{y) ,  and  the 

modulo  sum  rn(x+y)  is  drained  as  12.  The  voltage  at 
node  C  represents  k.  The  sum  rn(x)+rn(y)  at  M4  is 
compared  with  a  reference  current  at  M2,  representing 
the  digit  value  B .  We  recall  that  Q  corresponds  to  B . 


Layers  of  switches  are  separated  by  layers  of  The  comparison  results  in  ‘greater’  or  ‘less’,  leading 

summers  and  correction  units.  The  latter  are  depicted  by  to  C  ‘low’  or  ‘high’  respectively.  In  the  case  of  ‘less’, 


Figure  2:  CMOS  analog  current-mode  circuits  for  a  CVD  full  adder  (left)  and  CVD  correction 

with  binary  conversion  (right) 
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the  sum  is  simply  presented  at  the  output  by  M5  (k  =  0) . 
In  the  case  of  ‘greater’,  M6  is  turned  on,  and  the  value 
B  at  M3  is  subtracted  from  the  sum  (k  =  1) . 

The  right  circuit  in  Figure  2  performs  the  digit 
correction  and  delivers  a  binary  bit  for  each  CVD.  Digit 
r'n  is  drained  as  10,  and  r"n_{  as  II,  The  output  r"n  is 

drained  as  12.  The  corresponding  bit  a'n  is  provided  as  a 
voltage  VI. 

The  modulo  rounding  operation  in  Eqn.  (11)  is 
performed  by  comparing  the  difference  Vn—run_^/B  to 

the  digit  values  1/2  and  2/3  .  These  values  correspond 
to  quantities  0/ 4  and  30/4.  If  the  difference  lies 
between  the  two,  we  conclude  a\  =  1 ,  otherwise 
a'n  =  0 .  The  current  through  M4  equals  0/4 ,  and  the 

current  through  M6  equals  30/4 .  The  comparisons  are 
separately  evaluated  at  nodes  /  and  B ,  and  the  XOR 
delivers  the  bit  as  a  voltage  VI.  In  accordance  with  Eqn. 
(12),  for  a'n  =  1  the  cascade  of  switches  M5  and  M7 

adds  a  current  of  0/2  at  Ml 8  to  r"n_x/B  at  Ml 4. 

The  presented  circuits  have  been  designed  and 
simulated  in  a  0.8pm  CMOS  environment,  with  3.3V 
supply  voltage  and  0  =  50 .  The  4-by-4  multiplier  of 
Figure  1  has  been  successfully  designed,  based  on  these 
circuits.  The  optimization  for  speed  remains  a  topic  of 
future  research. 

The  reference  voltage  VO  in  both  circuits  may  be 
fixed,  or  adjusted  to  compensate  for  temperature 
variations.  The  accuracy  of  the  current  mirrors  chiefly 
determines  the  quality  of  the  circuit.  With  increased 
accuracy  the  accumulated  digit  error  is  reduced,  and  the 
column  size  in  a  multiplier  may  be  increased,  resulting 
in  fewer  layers.  Simpler  circuits  with  smaller  transistors 
introduce  more  error,  and  hence  the  architectural 
complexity  increases.  The  design  of  an  ORNS  based 
arithmetic  library  cell  is  therefore  directed  by  such 
trade-offs,  whereby  speed,  power  consumption,  area, 
and  switching  noise  are  components  in  the  cost  function. 

8.  Conclusions 

This  paper  has  discussed  the  role  of  analog  digits  in 
the  Overlap  Resolution  Number  System,  and  it  has 
reviewed  arithmetic  rules  and  interfaces  between 
multiple-valued  digits  and  continuous  valued  digits, 


with  a  particular  focus  on  the  binary  case.  We  have 
learned  that  a  binary  multiplier  of  any  dimension  can  be 
assembled  from  layers  of  switches  and  digit  refreshment 
circuits.  Analog  circuit  tolerance  is  not  a  prohibitive 
limitation  for  large  multipliers. 

ORNS  exploits  VLSI  circuit  accuracy  without 
resorting  to  a  radix  higher  than  B-2.  Nevertheless,  the 
presented  theories  allow  for  instance  quaternary  ORNS 
(B  =  4)  to  interface  with  4-level  MVL. 

CMOS  analog  current  mode  circuits  for  ORNS  are 
closely  related  to  current  mode  circuits  in  MVL.  They 
consist  of  familiar  structures  of  current  comparators  and 
current  mirrors. 
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Abstract 

Genetic  Algorithms  comprise  search  and  optimi¬ 
zation  strategies  which  are  inspired  by  natural  evolution: 
", survival  of  the  fittest ".  In  most  of  the  best  known  basic 
genetic  algorithms  a  binary  coding  of  solution  candidates 
is  used.  However  for  the  DNA-coding,  mother  nature  uses 
4  purine  bases:  adenine  (A),  cytosine  (C),  guanine  (G)  and 
thymine  (T).  Following  this  idea ,  the  present  paper  studies 
quaternary  coded  genetic  algorithms  and,  based  on  high 
complex  test  functions ,  shows  that  these  algorithms  have  a 
performance  as  good  as  their  corresponding  binary 
versions  for  problems  with  low  dimensions  and  reach  very 
fast  an  acceptable  good  fitness,  if  not  the  best,  for  high 
dimensional  problems.  For  the  first  time  it  is  shown,  that 
the  performance  of  genetic  algorithms  under  a  Gray  code 
is  sensitive  to  permutation  of  the  columns  of  the  code. 


1.  Introduction 

Genetic  Algorithms  were  introduced  by  John  Holland 
over  two  decades  ago  [Holl  75]  and  have  been  intensively 
studied  ever  since  (see  eg.  [Gold  89],  [Tane  89],  [Davi 
91],  [Eshe  91],  [Muhl  91],  [ScEs  93]). 

Following  the  original  idea  of  J.  Holland  most  genetic 
algorithms  use  a  binary  coding  of  their  individuals.  Real- 
coded  genetic  algorithms  have  been  developed  and  used 
with  success  (see  eg.  [Eshe  91],  [HeLM  98])  thus 
becoming  very  close  to  evolutionary  algorithms  [Rech  73], 
[Schw  75]  which  were  developed  for  real  parameter  opti¬ 
mization  and  consequently  work  always  with  real-valued 
individuals. 

One  of  the  particularly  most  representative  codings 
found  in  the  case  of  human  genetics  is  that  of  DNA  chains, 
which  uses  only  four  purine  bases  (A,  C,  G  and  T) 
between  a  two-stranded  winding  chain  of  deoxyribose  and 
phosphate,  thus  suggesting,  that  for  an  analog  purpose, 
nature  prefers  a  quaternary  code.  This  observation  motivat¬ 
ed  the  present  comparative  study  of  binary  and  quaternary 
coded  genetic  algorithms.  Nevertheless,  it  has  been  argued 
that  by  using  higher  cardinality  coding  alphabets,  fewer 
hyperplane  partitions  would  be  available  and  larger 
populations  would  be  necessary  (see  eg.  [Whit  93]).  The 


results  of  the  present  paper  show  that  this  is  not  the  case. 

The  paper  is  structured  as  follows.  In  the  next  section, 
the  basic  principles  of  genetic  algorithms  will  be  ex¬ 
plained.  This  will  be  followed  by  the  presentation  of  the 
different  aspects  to  be  considered  when  moving  from  a 
binary  to  a  quaternary  coding.  A  special  section  will  be 
devoted  to  introduce  the  test  functions  used  to  evaluate  the 
performance  of  the  algorithms.  The  paper  will  be  closed 
with  a  careful  analysis  of  the  test-results. 


2.  The  basic  genetic  algorithm 

Genetic  Algorithms  are  search  and  optimization 
procedures  inspired  by  biological  genetics  (selection, 
reproduction,  mutation)  which  exhibit  a  high  degree  of 
robustness  and  have  proven  to  offer  good  solutions  to  a 
number  of  relevant  hard  to  solve  problems. 

The  implementation  of  a  genetic  algorithm  comprises 
the  following  main  steps: 

1.  Random  initialization  of  a  properly  coded  population  of 
individuals  that  represent  candidates  for  the  solution  of  the 
given  problem. 

2.  Evaluation  of  the  goodness  of  each  individual  in  the 
population.  (Fitness  function) 

3.  Selection  of  the  best  individuals  with  a  probability 
proportional  to  their  relative  fitness.  Production  of  an 
intermediate  new  population  by  randomly  choosing  mating 
pairs  of  selected  individuals  and  applying  genetic 
operations:  crossover  (exchange  of  a  randomly  selected 
substring  between  two  individuals)  and  mutation  (change 
in  the  value  of  a  randomly  chosen  coding  position). 

4.  Evaluation  of  the  individuals  of  the  intermediate 
population. 

5.  Termination  of  the  algorithm  if  a  computing  time  limit 
or  a  satisfactory  solution  has  been  reached.  (Output:  the 
best  individual).  Otherwise  continuation  at  step  3. 

Quaternary  coded  genetic  algorithms 

It  becomes  apparent  that  the  structure  of  a  genetic 
algorithm  will  basically  not  change  if  a  quaternary  encod 
ing  is  used.  In  the  case  of  binary  coded  genetic  algorithms 
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the  most  frequently  used  encodings  are  the  position- 
weighted  and  the  Gray  code. 

It  will  be  first  analyzed  whether  besides  the  natural 
quaternary  coding,  obtained  from  the  coefficients  of  a 
polynomial  representation  of  integers  by  using  powers  of 
4,  a  quaternary  Gray  code  may  be  defined.  This  question 
may  be  positively  answered.  A  scheme  to  generate  a 
quaternary  Gray  code  is  shown  in  figure  1,  where  the 
Mfingers,,  have  to  be  properly  enlarged  in  the  case  of  a 
higher  number  of  variables.  It  becomes  apparent  that  it  is 
always  possible  to  construct  in  this  way  a  Hamilton-path  in 
(Z4)11.  Moreover  it  is  fairly  obvious  that  if  the  columns  of  a 
Gray  Code  are  permuted,  the  resulting  Code  is  also  Gray. 

With  respect  to  genetic  operations,  both  for  crossover 
and  selection  the  corresponding  operations  of  the  binary 
case  may  be  applied.  For  the  mutation  however,  the 
quaternary  case  offers  more  possibilities.  Three  mutation 
operations  will  be  discussed  below: 

Let  (an  l, a,,  a0 )  €  (Z4)11  represent  the  coding  of  an 

individual  and  let  aV  denote  a  mutated  value  at  the  i-th 
position.  The  following  mutation  strategies  will  be 
considered: 

1.  Random  mutation.  For  every  position  O^isn-1  and 


2.  Inversion  mutation.  Define  L  to  be  one  fourth  of  the 
length  n  of  an  individual.  Considering  the  mutation  proba¬ 
bility,  decide  whether  an  individual  should  be  mutated  or 
not.  If  yes,  generate  a  random  number  XeN,  L>A>0,  s.t.  its 
probability  distribution  is  inverse  proportional  to  the 
number  of  generations.  Moreover  choose  randomly  the 
mutation  start  position  s,  where  1  <s<n-k.  Finally  let  a’s+i 
=  as+Vi  for  all  i  s.t.  0<i<A.,  z'.e. starting  at  position  s  a 
substring  of  length  k  will  be  replaced  by  its  mirrored 
image.  (NB:  This  mutation  type  was  also  used  in  the  binary 
case.) 

3.  Delta  mutation.  This  mutation  type  is  based  on  the  non- 
uniform  mutation  introduced  by  Z.  Michalewicz  [Mich 
92].  For  every  position  0<i<n-l  and  taking  in  account  the 
predefined  mutation  probability,  ’’flip  a  coin"  to  decide 
whether  the  value  ^  should  be  mutated  or  not.  If  yes, 
generate  a  random  bit  be  {0, 1 }  and  let 

a’.  =  b[ai  +  A(g,  3-aj)]  +  (l-b)[a.  -  A(g,  a.)] 

where  g  denotes  the  number  of  the  present  generation  (the 
initial  population  represents  the  0-th  generation),  and  A  is 
defined  as 


taking  in  account  the  predefined  mutation  probability, 
’’flip  a  coin"  to  decide  whether  the  value  a;  should  be 
mutated  or  not.  If  yes,  generate  a  random  number  reZ4 
and  let  a’{  =  r.  Notice  that  if  r=  ai?  there  will  obviously  be 
no  change.  This  is  allowed. 


A:N0  x  {0,1, 2, 3}  {0, 1,2,3} 

A(g,  a)  =  a*  with  a*  €  {0, ...,  a} 

where  the  probability  of  producing  smaller  values 
increases  with  g. 


Fig.  1 :  Scheme  for  the  generation  of  quaternary  Gray  codes 
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3.  The  choice  of  test  functions  and 
algorithm  parameters 

In  order  to  evaluate  the  performance  of  genetic 
algorithms,  several  special  test  functions  have  been 
proposed.  For  this  paper,  the  test  functions  suggested  in 
[WRDM  96]  have  been  used,  since  they  have  been 
especially  developed  to  avoid  deficiencies  of  former  test 
functions.  Moreover  a  method  to  gradually  increase  their 
complexity  has  been  disclosed  in  [WRDM  96]. 

The  functions  are: 


Fl(x,y)  =  -  x  sinV|x-(y+47)|  - (y+47)  sin^  y+47+i 


F2(x,y)  =  x  sin(Y|y+l-x|)  cosVfy+l+xf  + 

+  (y+l)cos  (Y|y+l-x|)  sinVfy+l+xf 

with  x,y  e  [-512  ;  51 1] 

F3(x,y)  =  0.5  +  sjdmUZZ^Ll 
(l  +  O.OOl(x-y)2)2 

with  x,y  e  [-100  ;  100] 

The  following  method  to  gradually  increase  the 
complexity  of  the  test  functions  was  used: 

N-l 

EFa(x„...,  xN ) = ^  Fa(Xj,  xi+1 )  +  Fa(xN ,  x, ) 

i=l 

where  a  e  {1, 2, 3}  and  EF  stands  for  ’’extended  function*4. 
The  notation  EFaN  will  be  used  to  denote  an  extended  a- 
function  with  N  2-place  a- functions  as  components. 

In  order  to  adjust  the  basic  parameters  of  the  genetic 
algorithms  to  be  tested  (population  size,  probability  of 
crossover  and  mutation,  type  of  crossover,  mutation  and 
selection)  a  meta-genetic  algorithm  was  used  [Gref  86] 
upon  the  test  function  EF13  -(maximum  search)-  leading 
to  the  following  results  with  respect  to  the  Online- 
Performance  (average  of  the  fitness  of  all  individuals  by 
increasing  number  of  generations)  as  shown  in  Table  1  and 
Offline-Performance  (average  of  the  best  individual¬ 
fitness  through  the  past  generations)  as  shown  in  Table  2. 

When  adjusting  the  Parameters  of  the  Online-Perform¬ 
ance  a  2-point-crossover  and  a  tournament  selection  with 
elitism  (except  for  Gray-4)  was  used.  In  the  case  of  the 
Offline-Performance  a  1 -point-crossover  (except  for  the 
quaternary  Code:  2-p-c)  and  tournament  selection  with 
elitism  was  used  [Gold  89].  It  may  be  observed  that  the  ob¬ 


|  Tablel:  Parameters  for  Online-Performance 

Code 

Pop. 

Size 

Mutation 

type 

Binary 

380 

0.005 

0.50 

Invers. 

IsSSES 

360 

0.02 

0.55 

E— 

410 

0.001 

0.65 

370 

0.02 

0.45 

Invers. 

Table  2:  Parameters  for  the  Offline-Performance 

Code 

Pop. 

size 

Pmutation 

Pcross. 

Mutation 

type 

Binary 

390 

0.090 

0.65 

Random 

Gray-2 

410 

0.045 

0.85 

Compl. 

Quatem. 

380 

0.150 

1.00 

Delta 

Gray-4 

400 

0.150 

0.55 

Delta 

tained  population  size  is  much  larger  than  otherwise 
suggested  in  the  literature  (around  60);  but  this  very  much 
depends  on  the  degree  of  difficulty  posed  by  the  function 
(EF13)  selected  to  support  the  adjustment.  More  important 
is  to  notice  that  the  population  size  for  both  binary  and 
quaternary  coded  algorithms  is  very  similar.  It  should  also 
be  noticed  that  for  the  Online-Performance  the  inversion 
mutation  gave  the  best  results,  whereas  this  type  of 
mutation  is  hardly  mentioned  in  the  usual  literature. 


4.  Experimental  results 

The  most  important  results  of  this  study  may  be 
summarized  as  follows: 

(i)  In  the  case  of  problems  with  a  relatively  low 
complexity,  the  offline-performance  of  binary  as  well  as 
quaternary  coded  algorithms  are  very  similar,  as  shown  in 
figure  2  with  the  test  function  EF1 3. 

(ii)  The  pre-adjustment  of  the  algorithm  parameters  by 
means  of  a  meta  genetic  algorithm  contributes  to  a  notice¬ 
able  improvement  in  performance,  as  may  be  seen  in 
figure  3,  where  the  offline-  performance  of  optimized 
binary  and  quaternary  algorithms  processing  EF35  are 
compared  with  results  obtained  with  (binary  coded)  algo¬ 
rithms  using  the  set  of  parameters  suggested  by  K.  DeJong 
[DeJo  75]  and  J.  Grefenstette  [Gref  86],  It  should  be  notic¬ 
ed  that  the  quaternary  algorithms  achieve  their  highest 
fitness  with  only  50  generations,  meanwhile  the  straight 
binary  needs  over  600  generations.  The  Gray  coded  binary 
algorithm  needs  slightly  over  300  generations  to  reach  the 
same  level  of  performance,  but  reaches  later  a  roughly  5% 
better  performance. 

(iii)  Figure  4  shows  the  result  of  processing  EF1 10.  It  is 
simple  to  see  that  the  quaternary  algorithm  reaches  the 
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highest  fitness  much  faster  than  the  binary  and  binary  Gray 
genetic  algorithms. 

(iv)  The  performance  of  the  quaternary  algorithm 
depends  on  the  kind  of  mutation  that  is  used.  This  is 
clearly  disclosed  in  figure  5,  where  the  processing  of  the 
test  function  EF15  is  illustrated.  Depending  on  the  kind  of 
mutation  used,  a  difference  in  performance  of  up  to  10% 
may  be  observed. 

(v)  A  particularly  important  result  is  illustrated  in  figure 
6,  where  it  is  shown  that  (at  least  in  the  case  of  the  test 
function  EF35)  new  quaternary  Gray  codes,  obtained  by 
simply  permuting  the  columns  of  the  original  one  exhibit  a 
different  performance.  This  suggests  that  one  additional 
goal  of  the  meta  genetic  algorithm  should  be  the  search  for 
the  best  quaternary  Gray  code.  This  alone  is  a  hard 
problem,  since  the  search  space  is  huge:  every  single  Gray 
code  leads  to  (4°)^!)  possible  variations:  there  are  4° 
different  alternatives  to  code  Zero  and  there  are  n! 
different  permutations  of  the  columns  of  the  code. 


5.  Conclusions 

Quaternary  genetic  algorithms  have  been  introduced 
and  a  comparative  analysis  with  binary  algorithms  has 
been  done.  Particularly  difficult  test  functions  were 
selected  for  this  study  in  order  to  have  results  that  are 
significant  for  real-world-problems.  It  has  been  shown  that 
the  optimal  population  size  for  both  kinds  of  algorithms  is 
about  the  same  and  that  for  functions  of  low  dimension, 
binary  and  quaternary  algorithms  have  roughly  the  same 
performance.  In  the  case  of  higher  dimensional  functions, 
quaternary  algorithms  reach  a  high  level  of  fitness  much 
faster  than  the  corresponding  binary,  even  though  binary 
algorithms  will  reach  at  the  end  a  slightly  better  per¬ 
formance.  In  the  case  of  quaternary  algorithms  there  are 
different  kinds  of  mutation  and  the  choice  of  mutation 
affects  the  performance  of  the  algorithm.  Additionally,  in 
the  case  of  quaternary  Gray  coded  algorithms,  the  choice 
of  the  particular  Gray  code  also  affects  the  performance  of 
the  algorithm.  This  is  a  quite  new  result.  It  is  conjectured 
that  these  different  Gray  Codes  drive  the  exploration  and 
exploitation  activities  of  the  genetic  algorithm  into 
different  regions  of  the  problem  space  thus  affecting  the 
performance. 

The  results  of  this  study  suggest  that  in  the  case  of  high 
dimensional  problems  it  might  be  convenient  to  start  with 
a  quaternary  algorithm  and  by  the  time  the  fitness  remains 
asymptotic  stable,  change  to  a  binary  coded  one. 
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Note:  In  all  following  pictures,  the  vertical  axis  corresponds  to  the  relative  fitness  (function  maximum)  meanwhile  the  number 
of  generations  appears  in  the  horizontal  axis. 
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Figure  2:  Offline-Performance  of  EF13 
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Abstract 

This  paper  presents  a  class  of  complex  number  rep¬ 
resentations  called  Redundant  Complex  Number  Sys¬ 
tems  (RCNSs),  which  are  useful  for  designing  VLSI 
signal  processors  with  complex  arithmetic  capability. 
A  redundant  complex  number  system  is  defined  as  an 
imaginary-radix  number  system  having  a  redundant  in¬ 
teger  digit  set.  This  makes  possible  the  construction 
of  high-speed  complex  arithmetic  circuits:  examples  in¬ 
clude  a  complex-number  parallel  adder  with  no  carry 
propagation  chain ,  and  a  complex-number  multiplier 
using  fast  binary-tree  addition  structure.  This  paper 
also  presents  the  experimental  fabrication  of  the  RCNS- 
based  complex  multiplier  in  0.5pm  CMOS  technology. 


1.  Introduction 

Complex  arithmetic  computations  are  of  major  im¬ 
portance  for  various  signal  processing  and  scientific 
computation  algorithms.  Such  algorithms  include  com¬ 
plex  orthogonal  transformations,  convolutions,  corre¬ 
lations  and  complex  filtering.  These  applications  re¬ 
quire  efficient  representation  and  manipulation  of  com¬ 
plex  numbers  together  with  real  numbers.  To  simplify 
the  manipulation  of  complex  numbers,  several  authors 
have  proposed  special  number  representations  for  com¬ 
plex  numbers  [l]-[4].  However,  most  of  the  studies  fo¬ 
cused  on  the  theoretical  aspects  of  complex  number 
systems,  and  did  not  fully  discuss  the  practical  issues 
of  VLSI  circuit  design  and  application. 

Our  goal  is  to  develop  a  new  complex  number  repre¬ 
sentation  that  mates  possible  the  construction  of  high¬ 
speed  complex  arithmetic  circuits  which  can  be  in¬ 
stalled  into  practical  VLSI  signal  processors.  For  this 


purpose,  we  extend  the  concept  of  “imaginary  radix 
notation”  in  Knuth’s  quarter-imaginary  number  sys¬ 
tem  [5],  and  propose  a  class  of  complex  number  repre¬ 
sentations  called  Redundant  Complex  Number  Systems 
(RCNSs)  (see  [6]— [8]  for  earlier  discussions  on  this  top¬ 
ic1).  An  RCNS  is  a  radix-rj  system  with  digits  in 
{-a,  •  • ,  0,  *  •  • ,  a},  where  r(>  2)  is  an  integer,  j  de¬ 
notes  the  imaginary  unit  and  fr2/ 2]  <  a  <  r2  —  1.  The 
imaginary  radix  r  j  allows  unified  complex  number  rep¬ 
resentation  without  treating  real  and  imaginary  parts 
separately.  Also,  redundancy  in  number  representa¬ 
tion  allows  the  carry-propagation-free  addition  and  the 
binary-tree  multiple-operand  addition,  as  in  Avizienis’ 
Signed-Digit  (SD)  number  systems  [10]. 

This  paper  describes  efficient  designs  of  complex 
arithmetic  circuits  based  on  the  radix-2j  RCNSs:  ex¬ 
amples  include  a  complex-number  parallel  adder  with¬ 
out  carry  propagation,  and  a  complex-number  multi¬ 
plier  using  fast  binary-tree  multiple-operand  addition 
structure  together  with  an  operand  recoding  scheme  for 
reducing  complex-number  partial  products.  This  paper 
also  describes  the  result  of  implementing  the  proposed 
complex  multiplier  architecture  in  0.5^m  CMOS  tech¬ 
nology. 

2.  Redundant  Complex  Number  Systems 

2.1.  Definition 

A  redundant  complex  number  system  (RCNS)  is  de¬ 
fined  as  a  positional  number  system  that  has  an  imagi¬ 
nary  radix  rj,  where  r  is  an  integer  that  is  not  less  than 
2  and  j  denotes  the  imaginary  unit  (i.e.,  j2  =  -1). 
Each  digit  of  a  redundant  complex  number  can  assume 

xWe  presented  the  idea  of  redundant  complex  arithmetic  in 
1995.  Nielsen  and  Muller  also  suggested  similar  possibility  inde¬ 
pendently  of  our  works.  See  [9]  for  their  complex  adder  designs. 
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the  following  2a  +  1  values 


A*  =  {«,"•»  l>0,l,- ••»<*}»  (l) 

where  a  =  -a,  and  the  maximum  digit  magnitude  a 
must  be  within  the  following  range: 

y]<«<r2-l.  (2) 

The  notation  \x\  refers  to  the  least  integer  that  is  not 
less  than  the  real  number  x.  An  RCNS  with  radix  rj 
and  digit  set  Da  is  denoted  simply  by  RCNS  rj,  a. 
In  the  case  of  radix  2 j,  for  example,  we  can  consider 
the  two  distinct  systems:  RCNS  2j,  3  (with  the  digit 
set  D3  =  {3, 2,  T,  0,1, 2, 3})  and  RCNS  2j,2  (with  the 
digit  set  D2  =  {2,T,0, 1,2}).  The  algebraic  value  of  a 
number  X  =  (Xk-t  •  •  •  X0.X _i  •  •  •  X-t)  in  RCNS  rj ,  a 
notation  can  be  evaluated  as 


x  =  Xi(rjy.  (3) 

i——l 

Example  1  Given  an  explicit  value  1.25  —  1.5 j,  let 
us  consider  the  RCNS  2j,3  representation  of  the  value 
with  word  length  k  =  4  and  l  —  2.  There  are  twelve 
legitimate  RCNS  representations  as  follows: 


Sj 

4 

2  j 

1 

U  2 

1/4 

8  j 

4 

2  j 

1 

i/2 

1/4 

( 

0 

0 

1 

2 

1 

3  ) 

(0 

0 

0 

2 

.  3 

3  ) 

( 

1 

0 

3 

2 

1 

3  ) 

(0 

0 

1 

1 

.  1 

T  ) 

( 

0 

0 

0 

1 

3 

T  ) 

(1 

0 

3 

1 

.  1 

T  ) 

( 

0 

1 

1 

2  , 

1 

3  ) 

(0 

1 

0 

2 

.  3 

3  ) 

( 

1 

I 

3 

2  . 

1 

3  ) 

(0 

1 

1 

3 

.  1 

T  ) 

( 

0 

T 

0 

3  . 

3 

I  ) 

(1 

T 

3 

3 

.  I 

T  ) 

□ 

There  exists  a  close  relationship  between  RC- 
NSs  and  Signed-Digit  (SD)  number  systems.  That 
is,  if  we  decompose  the  radix-rjf  redundant  com¬ 
plex  number  representation  into  the  real  vector 
(X2[(k-1) /2J  *  *  *  X0  *  •  *  X„2[i/2\ )  and  the  imaginary  vec¬ 
tor  1)/2]— l  *  *  *  XiX-i  *  •  *  X -2\  if  2]+i)i  each  rep¬ 

resentation  can  be  regarded  as  a  special  Signed-Digit 
(SD)  number  representation  that  has  the  negative  radix 
— r2.  Note  here  that  the  notation  refers  to  the 
largest  integer  that  is  not  greater  than  the  real  number 
x.  Figure  1  summarizes  distinctive  features  of  RCNSs. 

In  this  paper,  we  focus  on  the  radix-2j  RCNSs,  i.e., 
RCNS  2j,3  and  RCNS  2j,2.  The  basic  arithmetic 
circuits  for  radix- 2 j  RCNSs  can  be  efficiently  imple¬ 
mented  with  the  state-of-the-art  redundant  binary  ar¬ 
chitecture  [11], [12]  as  discussed  in  Section  3.  Another 
interesting  choice  of  circuit  technology  is  current-mode 
CMOS  for  radix-4  arithmetic  [13]. 


The  addition  of  two  complex  numbers,  X  = 

andr  =  (y*-i---y*---y-i)  m 
RCNS  2j,3,  where  X^Yi  £  {3, 2, 1,0, 1,2,3},  is  per¬ 
formed  by  the  following  three  successive  steps  for  each 


digit: 

Step  1  Zi=Xi  +  Yu  (4) 

Step  2  -4 Ci  +  Ui  =  Zit  (5) 

Step  3  S{  =  Uj  +  Ci— 2,  (6) 

where  Z{  is  the  linear  sum  (of  X{  and  1{),  Ui  is  called 
the  intermediate  sum ,  and  Ci  is  the  carry .  The  ranges 
of  the  variables  are 

Xi,Yi  e  {3, 2,1,0, 1,2,3},  (7) 

Zi  6  {6,  *  ,0,  *  *  *  ,6},  (8) 

Ui  e  {2,  T,  0,1,2},  (9) 

Ci  e  {1,0,1},  (10) 

Si  e  {3, 2, T, 0,1, 2, 3}.  (11) 


The  digit  selection  rule  for  Step  2  can  be  written  as 


Ui  =  Zi-  4, 

Ci  =  1 

(if  Zi  >  2), 

Ui  = 

Ci  =  0 

(if  2  <  Zi  <  2), 

Ui  =  Zi  +  4, 

Ci  =  1 

(if  Zi  <  2). 

Note  that  there  are  two  possible  choices  for  the  val¬ 
ues  of  Ui  and  Ci ,  when  Zi  —  ±2.  The  actual  choice 
depends  on  the  circuit  implementation.  In  any  case, 
the  carry  output  Ci  is  determined  from  Zi  in  Step  2 
independently  of  the  other  carry  signals.  Thus  total¬ 
ly  parallel  addition  can  be  achieved  with  limited  carry 
propagation.  Figure  2  shows  an  example  of  RCNS  2j,  3 
addition.  This  type  of  carry-propagation-free  addition 
is  also  possible  in  RCNS  2j,2. 


X  =  3+4 j  Y=5  +  6y  S  =  X+Y  =  8  +  10j 


X  =  3  +  4 j  Y=5  +  6j  S  =  X+Y=8  +  10j 


weights  128)  64  32J  16  8]  4  2J  1 

X  11  3  3  2  T 

+)  Y  0  1  0  3  3  1 

Z  1  2  3  6  1  0 

U  A  A  A  A  -1  .0 


Ar  -—AT  -r-tr  —  AT  4T  A 

C  0  T  T  T  0  0 _ 

S  0  I  0  2  12  10 


Step  1 
Step  2 
Step  3 


weights  1261  64  32]  16  T)  4  2]  1 

X  01  II  11  11  10  01 

+>  y  oo  li  oo  li  li  oi 


z 

17 

C 


01  22  11  22  01  00 
Y )Yl T icr  LOr  )  o 


S  0  01  00  II  OI  10  01  00 


Step  V 
Step  2’ 
Step  3’ 


Figure  2.  RCNS  2j,3  addition. 


Figure  3.  RCNS  2j,3  addition  using  binary  SD  cod¬ 
ing. 


Table  1.  RCNS  2j,3  digit  representation. 


RCNS  X; 

3 

2 

T 

0 

1 

2 

3 

Binary  SD 

1  n 

Xi  Xi 

IT 

I  0 

o  T 

l  i 

0  0 

0  1 

1 1 

1  0 

1  1 

3.  Arithmetic  Circuits  Design 

3.1.  RCNS  2j,3  Adder  Using  Binary  SD  Coding 

As  it  will  become  apparent  later  on,  RCNS  2j,3  is 
useful  for  fast  complex-number  addition  because  of  its 
close  relationship  with  the  binary  SD  number  system2. 
This  section  describes  the  design  of  a  complex-number 
parallel  adder  circuit  using  binary  SD  coding  of  RCNS 
2j,3  digit.  On  the  other  hand,  RCNS  2j,2  is  partic¬ 
ularly  useful  for  multiplier  recoding  algorithm  as  de¬ 
scribed  in  Section  4. 

Let  X  =  ( Xk-i  -Xi  "  X-i)  be  a  complex  num¬ 
ber  in  RCNS  2j,3  notation.  When  RCNS  2j,3  arith¬ 
metic  circuits  are  constructed  with  binary  logic  ele¬ 
ments,  each  digit  X*  (G  {3, 2,  T,  0, 1, 2, 3})  must  be  cod¬ 
ed  into  a  vector  of  binary  bits.  We  introduce  here  bi¬ 
nary  SD  coding  of  the  RCNS  2 j,  3  digit.  In  this  coding 
scheme,  the  RCNS  2j,3  digit  X*  is  represented  by  a 
2-digit  binary  SD  code  xjxf  as  follows: 

Xi  =  2 x\  +  x°i  (^,^€{1,0,1}).  (12) 

Table  1  illustrates  this  coding.  Using  the  above  cod¬ 
ing,  RCNS  2j,3  addition  process  (4)-(6)  can  be  de¬ 
composed  as 

Step  1'  :  z\  =  x\  +  y\ ,  z?  =  x°i  +  27°.  (13) 

Step  2'  :  —2c-  +  u]  =  zj,  2 c°  +  u°  =  zf,  (14) 

Step  3'  :  sj  =  u\  +  c?,  s?  =  it?  +  c}_2.  (15) 

2 In  this  paper,  we  use  the  term  “binary  SD  number  system” 
instead  of  “redundant  binary  number  system”  in  order  to  avoid 
misinterpretation . 


The  ranges  of  the  variables  are  given  by  xf,yf  G 
{1,0,1},  zf  G  {2,1, 0,1, 2},  uf  G  {T, 0, 1},  c\  G 
{1,0,1},  and  sf  G  {1,0,1},  where  5  G  {0,1}.  In  Step 
2',  the  values  of  u\  and  c\  are  selected  in  order  that 
the  final  sum  sf  is  kept  within  the  range  of  {1,0, 1}. 
This  can  be  done  by  using  the  well-known  addition  al¬ 
gorithm  of  binary  SD  numbers  [11], [14]  with  a  slight 
modification  at  the  sign  of  upper  carry  digit  c\.  The 
values  of  uf  and  cf  are  selected  according  to  the  sign 
of  the  linear  sum  zf  (for  S  =  1)  or  z\_2  (for  S  =  0). 
Figure  3  shows  an  example  of  the  binary-SD-coded  R- 
CNS  2 j,  3  addition,  which  is  equivalent  to  the  example 
of  Figure  2.  The  calculation  for  a  single  digit  in  R- 
CNS  2j,3  addition  can  thus  be  performed  by  using 
a  pair  of  binary  SD  full  adders  (or  redundant  binary 
full  adders).  Figure  4  illustrates  the  RCNS  parallel 
adder  realization  using  binary  SD  full  adders.  Thus, 
the  well-known  binary  SD  architecture  [11], [12]  can  be 
employed  for  implementing  RCNS  2j,3  arithmetic. 

Figure  5  shows  the  proposed  configuration  of  RCNS 
2j,3  full  adder  circuit  using  a  pair  of  binary  SD  full 
adders.  We  have  derived  a  new  circuit  realization  of  a 
high-speed  binary  SD  full  adder  cell  by  transforming 
the  4-2  compressor  [15]  using  dual-rail  pass  transistor 
logic.  Using  the  proposed  RCNS  2 j,  3  full  adder  circuit, 
a  high-speed  complex-number  parallel  adder  with  no 
carry  propagation  chain  has  been  derived. 

3.2.  Vector  Notation  for  Algorithm  Description 

This  section  introduces  useful  notation  for  defin¬ 
ing  number  representation  systems,  which  is  an  ex¬ 
tension  of  Koren’s  notation  [14].  We  use  the  triplet 
(  D,  A,  W )  to  identify  a  positional  number  system  of 
k  +  l  digits  ( k  digits  in  the  integral  part  and  l  dig¬ 
its  in  the  fractional  parts),  where  D  is  the  digit  set , 
A  =  (Afc_i,  *  •  • ,  Ai,  *  •  • ,  A_/)  is  the  sign  vector ,  and 
W  -  (Wk- 1,—  ,  •  •  • ,  w-i)  is  the  absolute  weight 
vector ,  respectively.  The  algebraic  value  X  of  a  (A;-M)- 
digit  vector  (Xfc_i  •  •  •  X0.X_!  •  •  •  X_/)  in  the  system 
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RCNS  2j,3  FA:  RCNS  2], 3  Full  Adder 
Binary  SOFA  :  Binary  Signed-Digit  Full  Adder 
( Redundant  Binary  Full  Adder ) 

(b) 

Figure  4.  Realization  of  an  RCNS  parallel  adder  us¬ 
ing  binary  SDFAs. 


{  D,A,W)  is  given  by 

k- 1 

X  =  2  XiXiWi,  (16) 

i~-l 

where  Wi  is  the  absolute  value  of  the  weight  at  the  ith 
position,  and  A;  E  {1,1 

Using  the  triple  notation,  we  can  represent  R- 


CNS  2jf,3  of  4 

digits  (k  =  3,  l  = 

1)  by 

(  Drcns,ARcns, 

Wrcns  ),  where 

DrcnS  = 

{  3,  2,  T,  0,  1,  2,  3  }, 

A  RCNS  — 

(  i,  ii  1.  i  ). 

(17) 

Wrcns  = 

(  22,  21,  2°,  2-1  ). 

As  mentioned  in  the  last  section,  the  circuit  implemen¬ 
tation  of  RCNS  2j,3  is  based  on  the  decomposition 
of  RCNS  2jf,3  into  binary  SD  number  representation. 
By  decomposing  every  digit  of  the  above  triple  nota¬ 
tion  into  2-digit  binary  SD  code,  we  have  the  system 
(  Di,Ai,Wj  ),  where 

£>/  =  {  T,  0,  1  }, 

A /  =  (  1,  1,  3,  h  1.  1,  3,  3  )>  (18) 

m  =  (  23,  22,  22,  21,  21,  2°,  2°,  2-1  )• 


Xiu  y\+  yf  x?*  &  #  y? 


Figure  5.  RCNS  2j,3  full  adder:  (a)  logical  circuit 
diagram,  (b)  circuit  details  of  the  binary  SDFA  cell. 

4.  High-Speed  Complex  Multiplier 

4.1.  Complex  Multiplication  Algorithm 

This  section  is  to  design  a  complex-number  multi¬ 
plier,  that  can  be  installed  into  practical  VLSI  signal 
processors.  The  input  /output  numbers  of  this  multipli¬ 
er  are  assumed  to  be  in  the  merged  two’s  complement 
binary  form  (  J3m2ci )•  (To  obtain  this 
representation,  we  first  represent  the  real  part  and  the 
imaginary  part  of  the  input  operand  by  two’s  comple¬ 
ment  number  system,  and  merge  the  real  and  imag¬ 
inary  strings  into  one  string.)  We  also  assume  that 
both  the  multiplier  X  and  the  multiplicand  Y  are  giv¬ 
en  with  the  precision  of  (n,n)  bits3.  Figure  6  shows 
an  example  of  the  total  process  of  (4,4)  x  (4,4)-bit 
complex-number  multiplication. 

The  (n,n)-bit  multiplicand  X  is  first  given  in  the 


The  complex-number  parallel  adder  of  Figure  4  (b.)  ex¬ 
ecutes  the  operation  in  this  number  system. 
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3 The  symbol  (ni,ri2)  represents  the  precision  of  complex 
numbers,  where  n\  is  for  real  part  and  n<i  is  for  imaginary  part. 


X=5  +  2 / 


y=7+3j 


P  =  X  •  y  =  29  +  29,/ 


(Multiplicand)  X  =  5  +  2j 

Sign  A  m2c  7  T  1  j  j  1  1  j 

Weight  23  2 3  22  22  21  2 1  2°  2<> 

(D^aA^oW^j  0  0  10  10  10 

Jpo  yl  S»0  crl  JffO  crl  »f0  Wl 

*3  *2  *2  *1  *1  *0  *0  -1 

Sign  Modification  ^ 

Sign  Aj  J  T  T  y  /  1  1  J 

Weight  23  23  g2  ^2  gi  gi  jQ  2 0 

<DIiAI1ir1>  0  0  10X01  0 

fO  rt  vO  vl  r°  r1  r°  r* 

*3  *2  2  *1  *1  *0  -1 


(Multiplier)  Y  =  7  +  3j 

Sign  Am2c  T  1  J  j  1  1  j  j 

Weight  23  22  22  2*  21  2°  2 0  2 

<£U.A 0  1  0  1  1110 

V®  vl  v®  v  1  V®  v  1  V® 

?2  *M>  ‘'o  «M  y.l  /Av 

Radix-4  Booth 

Sign  1  j  1  j 

Weight  22  21  2°  2‘\ 

2  2  T  2 

Sign  Modification  ♦  ♦  ♦  ♦ 

Sign  T  j  1  J 

Weight  22  2 1  29  2A 

1  2  I  2 

RCNS2j,2  ^  Ti  y#  yi 


Partial  Products 


1st  Level  Addition 

*1+^0  * 
W 


P1+P2  z 

w 

c 

2nd  Level  Addition 

Sq+  Sj  z 

w 

c 

p 


Sign  A!  7  T  T  j  j  1  1  1JTT  j  j  1  1  7  7  T 

Weight  27  27  2e  26  2s  25  24  24  23  2 3  22  22 21  2^  20  2°  2*1  2*1 


0  0  0  1  0  1  0  1  0  0 
0  0  0  0  lot  010 

0  0  0  1  0  1  0  1  0  0 

0  0  0  1  0  1  0  I  0  0 

0  0  0  0  1  T  t  1  T  1  0  0 

o  0 


<To'  (To  i  o  o'TTT  (To' 

0  0  0  0  1  0  1  0  0  0  1  1  0  0 

0  0  0  1  0  2  0  0  0  1  0  0 

PJO  p  J  p  0  0  0  PJ  0  J) 


P-1 


Pi 

P2 


0  0  TO  10  0  OTTO  0 
0  0T01T00T0  0  TO  0 

0  0  I  0  1  I  0  0  0  0  1  I  0  0  i  to  0 
0  0  1  0  1  1  0  0  0 .0  1  3  0.0  11  .0  p 


Si 


Final  Product 


0  010101  0  0  o'  0  0'  0  0  0 

0  0  10  17  0  0  0  0  1  I  0  0  1  1  0  0  18  digits 

p?  pJ  p?  pJ  pg  Pi  p$  p§  Pi  4  pJ  Pi  pJ  p»  PiP?!  Pi 


Number  System  Conversion  | 

Sign  Am2c  j  T  1  j  j  1  1  j  j  1  1  j  j  1  *  j  j  1 

Weight  Wm2c  27  27  26  26  25  25  24  24  23  23  22  22  21  21  2°  2°  2-1  2'1 

0  0  0  0  0  0  1  1  1  1  1  1  0  0  1  1  0  0 

Figure  6.  An  example  of  (4,4)  x  (4,4)-bit  complex-number  multiplication. 


form  of  (  Dm2ci  Am2a^^m2c  )  &S 


Am2c 

J 

1 

1 

j 

■*  1 

1 

3 

Wm2c 

2n_1 

2n-i 

2 71-2 

2n~2  ■ 

21 

2° 

2° 

*n-l 

"T1 - 

Xn-2 

Xn-2 

*n-3  ■ 

’  •  *0 

Xq 

~  i 

X_i  , 

(19) 


where  each  digit  is  in  Dm 2c(=  {0, 1})  and  we  assume 
n  is  even  without  loss  of  generality.  This  multiplicand 
is  converted  into  (  Wj  )  expression: 


A  / 
Wi 

jn-i 

2n 

J-1  • 

2n~i  . 

•*  3* 

.  •  25+1 

j3  * 
2s  • 

•  *  3° 

21 

j°  j  1  j  1 
2°  2°  2-1 

u 

Xn  —  1  * 

xlA 

^5  • 

Xq 

Xq  xil 

£q  Xq  1 

(20) 


On  the  other  hand,  the  (n,n)-bit  multiplier  Y  in 
(  Dm2c,  Am2c,  Wm 2c )  is  converted  into  the  RCNS  2 j,  2 


representation  in  order  to  reduce  the  number  of  partial 
products.  This  multiplier  recoding  can  be  done  by  us¬ 
ing  the  well-known  radix-4  modified  Booth’s  algorith- 
m  [14]  as  follows:  Let  the  (n,  n)-bit  multiplier  Y  in 
(  Dm2aAm 2c,  Wm2c  )  be  given  by 


Am2c 

Wm2c 

to 

l^1 

1 

2n-2 

3 

2n— 2 

i 

2n“3  • 

■  •  i 

••  21 

1  3 

2°  2° 

3 

2-1 

Vn-2 

yil-2 

j/i-3 

CO 

I 

■  *  2/o 

Vo  D-i 

2/-1  ! 

(21) 


where  we  assume  that  the  range  of  the  n-bit  imaginary 
part  is  shifted  one  bit  relative  to  that  of  the  real  part 
in  order  to  simplify  the  algorithm  description.  Thus, 
the  least  significant  imaginary  digit  has  the  weight  of 
2-1.  (One  can  easily  adjust  the  ranges  by  introduc- 
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ing  a  dummy  digit  at  the  most  significant  position.) 
Recoding  the  multiplier  with  the  equation: 


Yt  =  -2  y]  +  ft+  2, 


(22) 


where  t  =  -1,0, 1,2,  •  •  *  ,n  — 
can  rewrite  the  multiplier  in 

2  and  yl2 
the  form: 

=  yl-  3 

=  0,  we 

Sign 

Weight 

1  3  ■ 

e^n—2  2  n— 3 

-  j  1 
•  •  21  2° 

3 

2-1 

CO 

t 

$? 

N 

1 

y 

••  Y[  Yl 

,  (23) 

where  Y(  €  {2, 1,0, 1,2}.  Modifying  the  sign  vector, 

Sign 

Weight 

jn-2  jn-3  . 

2  n  —  2  2 

■  ■  3 1  3° 

■■  21  2° 

r1 

2-1 

>  (24) 

CO 

1 

CM 

l 

••  Ti  To 

Y-i 

where  Yt  €  {2,T,  0,1,2}.  This  is  the  fixed  radix  num¬ 
ber  system  corresponding  to  RCNS  2j,  2.  Thus,  the 
multiplier  Y  is  recoded  into  the  RCNS  2j ,  2  vector 
(Yn- 2  ■Yf-  Y- 1)  given  by 


n—2 


Y  =  J2  Yt  X  (2 i)4. 

t=- 1 


(25) 


The  multiplicand  X  given  by  (20)  and  the  recoded 
multiplier  Y  given  by  (25)  generate  n  partial  products 
P n— 2>  '  *  ‘  i  -P—1 

Pt  =  X  ■  Yt  x  (2 j)*.  (26) 

Note  that  X  x  (2j)t  corresponds  to  the  2i-digit  shift  of 
the  vector  X  in  the  (  Di,Aj,Wj  )  notation,  since  this 
system  is  “decomposed”  version  of  RCNS  2j,3.  Hence, 
Pt  can  be  represented  in  (  Dj,  A/,  Wj  )  as 


A  j 

■n+t  -n+t  .  . 

f+* 

js+t  ■  ■ 

■■  3* 

jt  rl+t  r  l+t 

Wi 

2^4'i+i  2n+* 

2*+i+* 

2a+t  ■  ■ 

■ .  21+t  2*  2‘  2-1+t 

pit  ■■ 

Pst 

u 

Pst 

••  Pot 

Pot  pin  • 

(27) 


where  p8st  G  {1,0,1}  and  t  =  —  l,-*-,n  —  2.  Each 
digit  of  the  partial  products  is  calculated  by  shift  and 
complement  operations  according  to  the  value  of  Yt  as 


if 

Yt  =  2, 

then 

p\t  =  P°st  =  Z*-2. 

if 

Yt  =  1, 

then 

Pit  =  P°st  = 

if 

Yt  =  0, 

then 

Plt=$t=  o,  _ 

if 

Yt  =  1, 

then 

Pit  =  Z},  P°st  =  X°s’ 

if 

Yt  =  2, 

then 

Pit  =  X°s>  P°st  =  **-2- 

These  partial  products  Pn_ 2,  •  •  • ,  P~i  are  in  the  for- 
m  of  (  £)/,  A/,  Wj  ).  The  next  step  is  to  accumulate  all 
these  partial  products  to  obtain  a  single  product  P  us¬ 
ing  RCNS  parallel  adder  shown  in  Figure  4  (b).  Since 


the  RCNS  parallel  adder  is  regarded  as  a  2-to-l  reducer, 
a  highly  regular  binary-tree  structure  of  [log2  n]  levels 
can  be  employed  for  high-speed  accumulation  of  the  n 
partial  products.  The  result  of  the  adder  tree  is  the 
(4n  +  2)-digit  product  P  in  (  Dj,  Aj,  Wi  )  and  conse¬ 
quently  a  conversion  to  the  merged  two’s  complement 
representation  (  Dm 2C,  Am 2c,  Wm 2C )  is  needed,  in  most 
cases.  This  conversion  can  be  performed  by  subtract¬ 
ing  the  digits  having  negative  sign  elements  from  the 
digits  having  positive  sign  elements  for  real/imaginary 
components  independently.  (This  conversion  process  is 
similar  to  the  well-known  SD-to- binary  conversion.)  In 
the  case 'of  (n,  n)  x  (n,n)-bit  multiplier,  the  conversion 
can  be  performed  by  employing  a  pair  of  (2 n  +  l)-bit 
carry  propagate  adders  for  the  real  and  imaginary  parts 
independently. 

4.2.  Implementation  and  Evaluation 

The  block  diagram  of  the  proposed  complex  mul¬ 
tiplier  is  illustrated  in  Figure  7  for  (4,4)  x  (4,4)-bit 
multiplication.  Using  the  RCNS  representation,  we 
can  construct  a  complex-number  multiplier  exhibiting 
a  highly  regular  topological  structure  that  resembles 
those  of  real-number  multipliers.  Table  2  compares 
real-number  multipliers  using  the  binary  SD  number 
representation  and  complex-number  multipliers  using 
the  RCNS  representation  in  terms  of  hardware  com¬ 
plexity  and  the  number  of  adder  tree  stages.  In  gen¬ 
eral,  the  (n,  n)  x  (n,  n)-bit  complex-number  multiplier 
exhibits  the  same  level  of  circuit  complexity  as  that  of 
the  2n  x  2n-bit  real-number  multiplier. 

In  order  to  confirm  the  proposed  principle,  an 
(8, 8)  x  (8, 8)-bit  redundant  complex  multiplier  has  been 
designed  and  implemented  in  0.5  pm  standard  CMOS 
technology.  Figure  8  shows  the  chip  micrograph  and  its 
main  features.  The  chip  integrates  15,340  transistors  in 
an  active  area  5.98  mm2.  The  computation  time  could 
be  further  improved  to  the  same  level  with  state-of-the- 
art  real-number  multipliers  in  principle,  since  there  is 
no  essential  difference  between  the  proposed  multiplier 
configuration  and  the  conventional  real-number  multi¬ 
pliers  using  binary  SD  (redundant  binary)  adders  [12] 
or  4-2  compressors  [15]. 

When  a  complex  multiplier  is  implemented  on  the 
basis  of  conventional  real-number  arithmetic,  four  mul¬ 
tipliers  and  two  adders  must  be  connected  with  compli¬ 
cated  global  interconnections.  The  use  of  RCNS  makes 
possible  the  construction  of  a  regular  complex  multi¬ 
plier  without  using  such  global  interconnections.  We 
have  compared  the  proposed  design  with  the  complex- 
number  multiplier  consisting  of  four  real-number  multi¬ 
pliers  and  two  real-number  adders  using  redundant  bi- 
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C  Multiplicand  X  ) 
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Sign  Converter 

-Po 

||]  Booth  Encoder 

*So 

pi  Partial  Product 
^  Generator 

-Pi 

|"1  Binary  SDFA 

-sx 

*-p 

RCNS-to-Binary  Converter 

1  I  I  I  I  I  (  Final  Product^  P  )1  I  I  1  I  I  I 

Figure  7.  (4,4)  x  (4,4)-bit  complex-number  multiplier. 


Supply  Voltage:  3.3  V 

Execution  Time:  12.9  ns 

Active  Area  Size: 

1.87  mm  X  3.20  mm 
(5.98  mm2) 

Transistor  Count: 

15,340  (except  I/O) 

Technology: 

VDEC  0.5  um  CMOS 
Double  Metal 


Figure  8.  (8,8)  x  (8, 8)-bit  complex-number  multiplier  chip. 


nary  arithmetic.  It  is  estimated  that  the  RCNS-based 
design  achieves  21%  reduction  in  chip  area  for  (8,8)- 
bit  precision,  and  27%  reduction  for  (32, 32)-bit  preci¬ 
sion.  Also,  the  elimination  of  global  interconnections 
makes  possible  significant  reduction  in  wiring  capaci¬ 
tance,  which  leads  to  high-speed  low-power  operation. 


us  to  design  low-overhead  real/complex  reconfigurable 
arithmetic  circuits  [8].  As  an  example,  we  have  also 
developed  a  real/complex  reconfigurable  arithmetic  u- 
nit,  which  can  change  its  structure  for  three  different 
arithmetic  modes  in  real  time.  The  detailed  result  will 
be  reported  in  future  papers. 


5.  Conclusions 

This  paper  presents  the  redundant  complex  arith¬ 
metic  for  VLSI  signal  processing,  and  its  application 
to  complex  multiplier  design.  Another  unique  property 
of  the  redundant  complex  arithmetic  is  its  compatibili¬ 
ty  with  the  Signed-Digit  (SD)  arithmetic,  which  allows 
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Table  2.  Comparison  of  real-number  multipliers  and  complex-number  multipliers. 


Operand 

Wordlength 

(bit) 

Number  of  Modules* 

Number  of 
Transistors** 

Number  of 
Inter-Module 
Interconnects 

Number  of 
Adder  Tree 
Levels 

BE 

PPG 

SDFA 

CPA 

8x8 

4 

36 

30 

16  bit  X  1 

3862 

428 

2 

Real- 

16  X  16 

8 

136 

132 

32  bit  x  1 

14198 

1688 

3 

Number 

32  X  32 

16 

528 

537 

64  bit  X  1 

53382 

6588 

4 

Multipliers 

64  x  64 

32 

2080 

2133 

128  bit  x  1 

203558 

25752 

5 

128  X  128 

64 

8256 

8432 

256  bit  X  1 

788774 

101228 

6 

(4,4)x(4,4) 

4 

40 

34 

9  bit  x  2 

4328 

474 

2 

Complex- 

(8,8) x (8,8) 

8 

144 

142 

17  bit  X  2 

15340 

1794 

3 

Number 

(16,16)x(16,16) 

16 

544 

560 

33  bit  x  2 

55500 

6826 

4 

Multipliers 

(32,32)  x  (32,32) 

32 

2112 

2184 

65  bit  x  2 

208300 

26274 

5 

(64,64)  x  (64,64) 

64 

8320 

8542 

129  bit  X  2 

798956 

102346 

6 

*BE:  Booth  Encoder,  PPG:  Partial  Product  Generator,  SDFA:  Binary  SD  Full  Adder, 
CPA:  Carry  Propagate  Adder  (for  Number  System  Conversion) 


**The  designs  are  based  on  standard  CMOS  technology. 
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On  the  Number  of  Multilinear  Partitions  and  the  Computing 
Capacity  of  Multiple- Valued  Multiple-Threshold  Perceptrons 


Alioune  Ngom  *  Ivan  Stojmenovic  t  Jovisa  Zunic  * 


Abstract 

We  introduce  the  concept  of  multilinear  partition  of 
a  point  set  V  C  Rn  and  the  concept  of  multilinear 
separability  of  a  function  f  :  V  h+  K  —  {0, . . . ,  k  - 
1}.  Based  on  well  known  relationships  between  lin¬ 
ear  partitions  and  minimal  pairs ,  we  derive  formulae 
for  the  number  of  multilinear  partitions  of  a  point  set 
in  general  position  and  of  the  set  K 2.  The  (n,k,s)- 
perceptrons  partition  the  input  space  V  into  5  +  1  re¬ 
gions  with  s  parallel  hyperplanes.  We  obtain  results  on 
the  capacity  of  a  single  (n,  k,  s) -perceptron,  respectively 
for  V  C  Rn  in  general  position  and  for  V  =  K 2 .  Fi¬ 
nally,  we  describe  a  fast  polynomial-time  algorithm  for 
counting  the  multilinear  partitions  of  K 2 . 


1  Introduction 

Let  V  C  Rn  and  K  =  {0, . . . ,  A;  —  1},  with  k  >  2.  A 
n-input  k-valued  real  function  f  :  V  h+  K  is  a  function 
with  real-valued  inputs  and  /.-valued  outputs.  When 
V  C  Kn,  we  will  refer  to  /  as  a  n-input  k-valued  logic 
function.  We  denote  by  P£  the  set  of  all  n-input  k- 
valued  real  (resp.  logic)  functions  /  :  V  K  and 
by  Pk  =  Un>i  set  ^-valued  real  (logic) 

functions.  For  instance,  for  k  =  2  P2  is  the  set  of  all 
binary  logic  functions. 

A  n-input  k- valued  5-threshold  perceptron  [3],  ab¬ 
breviated  as  (n,  k ,  s) -perceptron,  computes  a  n-input  k- 
valued  weighted  s -threshold  function  F£a(w,t,c>)  given 
by 

*  Computer  Science  Department,  Lakehead  University,  855 
Oliver  Road,  Thunder  Bay,  Ontario  P7B  5E1,  Canada,  an- 
gom@ice.lakeheadu.ca.  Research  is  partially  supported  by 
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t Computer  Science  Department,  School  of  Information  Tech¬ 
nology  and  Engineering,  University  of  Ottawa,  150  Louis  Pas¬ 
teur,  Ottawa,  Ontario  KIN  9B4,  Canada,  ivan@csi.uottawa.ca. 
Research  is  partially  supported  by  NSERC 

^Institute  of  Applied  Basic  Disciplines,  Faculty  of  Engineer¬ 
ing,  University  of  Novi  Sad,  Veljka  Vlahovtfa  3,  21000  Novi  Sad, 
Yugoslavia 


{oo  if  wx  <  ti 

Oi  if  ti  <wx  <  ti+i  1  <  i  <  s  -  1 
Os  if  ts  <  wx 

where  input  vector  x  =  (xi,...,xn)  e  V,  weight 
vector  w  =  (uii, . . .  ,wn)  €  V,  threshold  vector  t  = 
(ti,...,t5)  e  R8  and  U  <  ti+1  (1  <  i  <  s  -  1  and 
1  <  s  ^  hn  -  1,  the  number  of  threshold  values),  and 
output  vector  o  =  (o0,...,os)  €  K8+1 .  A  ( n,k,s )- 
perceptron  simulates  a  /?- valued  function  f  :  V  ■-+  K. 
Depending  on  V  we  will  either  refer  to  real  or  logic 
(n,  k ,  5)-perceptrons.  It  is  well  known  that  any  n-input 
fc-valued  logic  function  can  be  transformed  into  a  k- 
valued  s-threshold  function  [1]  for  some  s. 

Multiple-threshold  devices  [1]  are  threshold  elements 
containing  multiple  levels  of  excitation  (thresholds). 
They  have  drawn  less  enthousiasm.  Among  there  qual¬ 
ities,  though,  is  that  given  enough  thresholds,  a  single 
multiple-threshold  element  can  realize  any  given  func¬ 
tion  operating  on  a  finite  domain. 

The  (n,  k,  k  —  1) -perceptrons  were  introduced  by  [4] 
as  model  for  simulating  continuous  perceptrons  with 
limited  precision  on  their  inputs.  Learning  algorithms 
for  some  classes  of  (n,k,  5) -perceptrons  are  investigated 
in  [3].  [7]  introduced  the  ( n ,  2,  s) -perceptrons  and  their 
computational  power  have  been  intensively  studied  in 
[5], 

In  this  research  paper  we  propose  to  compute  the 
capacity  of  a  (n,  k,  s)-perceptron,  that  is,  the  number 
of  A;- valued  functions  /  :V  K  that  can  be  simulated 
by  a  single  (n,  k ,  s)-perceptron. 

There  has  been  an  intensive  interest  in  threshold 
logic  as  the  main  component  of  neural  network  mod¬ 
els.  These  models  provide  a  direction  for  pattern  recog¬ 
nition  systems  with  distinct  natural  advantages.  The 
capacity  of  these  models,  as  well  as  their  computing 
power,  are  directly  related  to  the  number  of  thresh¬ 
old  functions.  The  ability  of  multiple-threshold  devices 
to  simulate  a  larger  number  of  functions  compared  to 
single-threshold  devices  is  vital  for  the  capacity  and  ca¬ 
pabilities  of  neural  network  models  based  on  threshold 
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logic.  It  is  therefore  of  practical  as  well  as  theoretical 
interest  to  estimate  the  number  of  functions  that  can 
be  modeled  as  multiple-threshold  functions  for  a  given 
number  of  inputs  and  threshold  levels. 

2  Multilinear  separability  and  multilin¬ 
ear  partition 

The  problem  of  computing  (or  simulating)  a  given 
function  /  by  a  (n,  k ,  s)-perceptron,  is  to  determine  s 
and  a  vector  r  =  (w,  t,  o)  €  Rn+S  x  K 5+1  such  that 
F£8(7)(x)  =  m  (Vx  €  V),  i.e.  /  =  F£,(r).  We  will 
refer  to  r  as  a  s -representation  of  Fg8  for  /. 

Let  VC  Rn  with  cardinality  \V\  =  v  >  1.  A  k- 
valued  function  /  :  V  K  is  s -separable  if  and  only 
if  it  has  a  s-representation  (w,t,o).  In  other  words,  a 
(n,  s)-perceptron  partitions  the  space  V  C  Rn  into 
s  +  1  distinct  classes  •  •  •  i  Bs°m\  using  s  parallel 
hyperplanes,  where  =  {a?  €  ^(/(if)  =  Oj  and 

tj  <  wx  <  tj+i}.  We  assume  that  to  =  -oo  and 
t8+i  =  +oo.  Each  hyperplane  equation  denoted  by  Hj 
(1  <  j  <  s )  is  of  the  form 

Hj  :  wx  =  tj 

Multilinear  separability  (s-separability)  extends  the 
concept  of  linear  separability  (1-separability  of  the 
common  binary  1-threshold  perceptron)  to  the  (n,  k ,  s)- 
perceptron.  Linear  separability  in  two- valued  case  tells 
us  that  a  (n,2,  l)-perceptron  can  only  learn  from  a 
space  V  C  [0,  l]n  in  which  there  is  a  single  hyperplane 
which  separates  it  into  two  disjoint  halfspaces:  = 

{f|/(5)  =  0}  and  h[1]  =  {x\f{x)  =  1}.  From  the 
(n,  2,1) -perceptron  convergence  theorem  [6],  concepts 
which  are  linearly  nonseparable  cannot  be  learned  by 
a  (n,  2,  l)-perceptron.  One  example  of  linearly  nonsep¬ 
arable  two-valued  logic  function  is  the  n-input  parity 
function.  Likewise,  the  (n,  k,  s) -perceptron  convergence 
theorems  [3]  state  that  a  (n,  ft,s)-perceptron  computes 
a  given  function  /  E  P£  if  and  only  if  /  is  s-separable. 
Figure  1  shows  an  example  of  2-separable  4- valued  logic 
function  of  . 

Let  V  C  Rn  with  \V\  =  v  >  2.  A  (n,v,s) -partition 
is  a  partition  of  V  by  s  <  v  -  1  parallel  hyperplanes 
(namely  (n  -  l)-planes)  which  do  not  pass  through  any 
of  the  v  points.  For  instance  Figure  1  also  illustrates  an 
example  of  (2, 52, 2)-partition  of  the  given  set  of  points 
in  Pi. 

A  (n,v,s)-partition  determines  s  +  1  distinct  classes 
So,..., S 8  C  V  separated  by  s  parallel  (n  —  l)-planes 
such  that  UU  ^  =  CLo  &  =  ®  and  Si  7^  0*  A 
(n,  v,  $)-partition  corresponds  to  a  s-separable  A:-valued 
function  /  E  if  and  only  if  all  points  in  the  set  Si, 


2  2  2  2  2 


Figure  1:  A  2-separable  function  of  Pi. 

have  the  same  value  taken  out  of  K.  Also,  we  assume 
that  any  two  neighboring  classes  have  distinct  values. 
Clearly,  the  number  of  associated  functions  is  k(k  - 1)* . 

For  given  s  >  2,  the  capacity  of  a  (n,k,s)- 
perceptron  with  domain  V  is  approximated  by  the 
product  of  the  number  of  (n,v,s)- partitions  and  the 
number  of  functions  associated  with  each  ( n,v,s )- 
partition  of  V.  Thus  counting  the  number  of  (n,  v,  s)- 
partitions  of  V  is  a  first  step  toward  calculating  the 
capacity  of  ( n ,  k,  s)-perceptrons. 

A  linear  partition  of  a  point  set  V  is  a  (n,v,  1)- 
partition.  The  enumeration  problem  for  linear  parti¬ 
tions  is  closely  related  to  the  efficiency  measurement 
problem  for  linear  discriminant  functions  in  pattern 
recognition  and  to  many  other  algorithmic  problems. 
The  capacity  of  (n,  2,  l)-perceptrons  with  domain  V 
[6]  is  well-known  and  is  given  by 

{T  if  n,  >  v  —  1 

2”  -  2  Ei=n+1  ( ”  7 1 )  otherwise 

[5]  estimated  lower  and  upper  bounds  for  the  capac¬ 
ity  of  (n,2,s)-perceptrons,  using  two  essentially  differ¬ 
ent  enumeration  techniques.  The  paper  demonstrated 
that  the  exact  number  of  multiple-threshold  functions 
depends  strongly  on  the  relative  topology  of  the  input 
set.  The  results  corrected  a  previously  published  es¬ 
timate  [7]  and  indicated  that  adding  threshold  levels 
enhances  the  capacity  more  than  adding  variables. 

3  Linear  partitions  and  minimal  pairs 

An  unordered  pair  (x,  y)  of  distinct  points  of  a  finite 
set  V  C  Rn  is  a  minimal  pair  in  V  if  there  does  not 
exist  a  third  point  z  of  V  which  belongs  to  the  open 
line  segment  [x,y\.  A  set  V  C  Rn  is  in  general  position 
if  and  only  if  no  subset  of  V  of  d  +  1  points  lies  on 
a  (d  -  l)-plane  (for  1  <  d  <  n)  and  no  two  (n  -  1)- 
planes  defined  from  V  are  parallel  (for  2  <  d  <  n). 
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The  number  of  linear  partitions  of  a  finite  planar  set, 
no  three  points  of  which  are  collinear,  is  well  known. 
The  number  of  linear  partitions  of  V  in  general  position 
is  a  well  known  result  of  pattern  recognition  [6],  and  is 
given  by 


*»-=£(  "71)  « 

i-0  x  7 

A  relationship  between  linear  partitions  and  mini¬ 
mal  pairs  of  a  finite  planar  set  V  is  given  in  [2],  Namely, 
that  the  number  of  linear  partitions  of  V  is  equal  to 
the  number  of  minimal  pairs  in  V .  Moreover,  each 
minimal  pair  determines  two  linear  partitions  of  V  and 
conversely,  each  linear  partition  of  V  has  exactly  two 
associated  minimal  pairs  [2].  These  facts  provide  an 
easier  way  of  counting  linear  partitions  for  arbitrary 
finite  planar  point  sets:  it  suffices  to  count  minimal 
pairs. 

In  [2]  an  explicit  formula  for  the  number  of  linear 
partitions  of  the  (i,j)-grid  (a  rectangular  part  of  the 
infinite  grid)  is  stated,  using  the  correspondence  with 
minimal  pairs.  The  same  reference  includes  an  efficient 
algorithm  for  counting  linear  partitions  of  the  (i,  j)-grid 
in  linear  time  with  respect  to  the  number  of  its  points. 


4  Counting  multilinear  partitions  of 
subsets  in  general  position 


In  this  section,  V  C  Rn  is  in  general  position 
and  LnyViS  denotes  its  number  of  (n,  v,  s)-partitions. 
Clearly,  every  pair  of  points  in  V  is  a  minimal  pair.  We 
first  obtain  the  number  of  (2,  w,  s)-partitions,  that  is  for 
the  two  dimensional  space,  and  then  we  find  LnyVj8. 

We  rewrite  the  number  of  linear  partitions  of  V  in 
general  position  with  a  little  modification  as 


Tl  /  ^  \ 

=  £n,v— 1,1  H”  1,17—1,1  =  ^  ^  (  i  ) 


The  difference  with  the  original  formula  (1)  is  that 
the  index  i  starts  with  1.  The  reason  is  that  we  do  not 
include  partitions  containing  empty  classes. 

To  count  the  (2,u,s)-partitions,  we  associate  each 
(2,u,s)-partition  with  a  slope  a  6  R  (or  equivalently, 
a  minimal  pair)  as  follows. 


•  For  each  of  the  s  partitioning  lines,  choose  one  of 
the  two  minimal  pairs  which  has  the  smaller  slope; 


•  The  associated  slope  of  the  given  (2,  v,  s)-partition 
is  the  maximum  among  these  smaller  slopes. 

Lemma  4.1  The  number  of  (2,  u,  s)-partitions  associ¬ 
ated  with  a  given  slope  a  is 


Theorem  4.1 


Proof  Since  V  is  in  general  position,  then  there  are 

^  2  ^  slopes.  The  inequality  is  explained  by  the  fact 

that  two  distinct  choices  of  minimal  pairs  (or  slopes)  may 
have  sets  of  associated  (2,  v,  s)-partitions  that  intersect  each 
other.  Therefore  a  given  partition  may  be  counted  many 
times,  depending  on  the  configuration  of  V.  For  instance 
consider  one  of  the  partitions  in  Figure  2.  Clearly,  the  same 
partition  can  be  obtained  either  by  selecting  the  upper  min¬ 
imal  pair  or  by  selecting  the  lower  minimal  pair  (indeed  in 
this  example,  they  both  give  the  same  set  of  associated  par¬ 
titions,  even  though  the  points  are  in  general  position).  We 
have  equality  only  when  s  =  1.  • 


Theorem  4.2 


Proof  The  proof  is  similar  to  that  of  Theorem  4.1. 
We  select  a  linear  partition  of  V  in  Ln,v,i  ways.  For  each 
such  linear  partition,  we  sort  the  points  of  V  along  the 
direction  of  the  separating  (n—  l)-plane,  that  is  according  to 
their  distance  to  that  hyperplane.  To  construct  a  (n,  v,  s)- 
partition  we  must  add  s  —  1  more  parallel  ( n  —  l)-planes. 

There  are  ^  ^  ^  ^  ways  to  place  s  —  1  new  hyperplanes 

in  v  —  2  available  positions.  Here  again  we  have  equality 
only  for  s  =  1.  • 

Corollary  4.1  The  number  of  n-input  k-valued  s - 
separable  functions  f  :  V  K  is 


Figure  2:  Two  (2, 4, 2)-partitions  for  the  same  function. 

For  k  -  2,  Corollary  4.1  gives  a  much  better 
(tighter)  upper  bound  on  the  capacity  of  (n,2,  s)- 
perceptrons  than  the  following  upper  bound  obtained 
in  [5]. 
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However,  in  (2),  the  constant 


also  counts 


partitions  containing  empty  classes.  So,  to  remove 
them,  we  must  replace  it  by 

A  s-separable  function  /  :  V  H*  K  is  permutably 
homogeneous  if  and  only  if  s  <  k  —  1  and,  for  any 
partitioning  of  V,  no  two  separate  classes  have  equal 
value  [3].  These  functions  are  determined  by  (n,i>,$)- 
partitions  in  which  each  of  the  s  +  1  classes  So,  •  •  •  ,  Ss 
maps  to  a  distinct  value  in  K. 


Theorem  4.3  A  permutably  homogeneous  s-separable 
function  f  :V  ^  K  has  a  unique  (n,v,$) -partition. 


Proof  Suppose  /  has  two  distinct  (n,  u,  ^-partitions 
Pi  and  P2.  Consider  two  points  x  and  y  which  are  in  the 
same  class  with  respect  to  Pi  but  in  different  classes  with 
respect  to  P2.  We  have  that  f(x)  —  f(y ),  because  x  and  y 
share  the  same  class  in  Pi.  This  means  that  two  distinct 
classes  in  P2  have  the  same  value.  This  contradicts  the 
definition  of  permutably  homogeneous  function.  • 


Corollary  4.2  The  number  of  permutably  homoge¬ 
neous  n-input  k-valued  s-separable  functions  f  :  V 
K  is 


5  Counting  multilinear  partitions  of 
the  (&,  fc)- grid 


In  this  section  we  generalize  the  counting  method  of 
[2]  to  enumerate  multilinear  partitions  of  the  (k,  A;)-grid 
K 2. 

Let  (x,y)  be  a  pair  of  points  from  a  finite  planar 
grid.  Let  a  =  max(|z2--£i|,  \y2-y1D  and  b  =  minfla^- 
Xi  | ,  1 2/2 -yi  |)«  One  well  known  condition  for  a  pair  (x ,  y) 
to  be  minimal  is  that  a  and  b  are  relatively  prime,  that 
is  their  greatest  common  divisor  is  1.  a  _L  b  will  denote 
that  the  integers  a  and  b  do  not  have  common  divisor. 

Thus  the  number  of  minimal  pairs  corresponds  to 
the  number  of  pairs  ( x,y)  of  the  (i,j)-grid  such  that 
a  X  b.  Let  natural  numbers  i  and  j  be  given  so  that 
i  <  j.  The  generalized  Farey  (i,j) -sequence  Fi,j  [2]  is 
the  strictly  increasing  sequence  of  all  the  fractions  of 
the  form  where  the  integers  a  and  b  satisfy:  a  1  b, 
0  <  b  <  a  <  j,  b  <  i.  Thus  the  sequence  is  as 
follows: 

11112  1  231.43234 
7654735727534  5* 

The  Farey  i-sequence  Fi  for  any  positive  integer  i  is 
the  set  of  irreducible  rational  numbers  with  0  <  b  < 


a  <  i  and  a  _L  6,  arranged  in  increasing  order  [2].  So 
the  sequence  P4  is: 


0  1112  3  1 
1432341* 

The  length  of  the  sequence  F^j  (resp.  Fi)  will  be 
denoted  by  \Fij\  (resp.  |F*|).  Also,  Ffj  (resp.  Ff) 
stands  for  the  d- th  fraction  in  Fij  (resp.  Fi),  1  <  d  < 
\Fij\  (resp.  \Fi\). 

To  count  the  (2,  fc2,  s)-partitions,  we  associate  to 
each  (2,fc2,s)-partition  a  fraction  £  £  Fk- 1  as  before 
Lemma  4.1,  where  v  —  k2. 

Then  to  enumerate  or  generate  the  (2 ,k2,s)- 
partitions  associated  with  a  given  |  we  will  need  Lem¬ 
mas  5.1  and  5.2  below.  As  in  figure  3,  rotate  a  line 
segment  [a?,  y\  whose  slope  £  is  irreducible  to  increase 
it  for  a  small  amount,  so  that  we  obtain  a  straight  line 
P  with  slope  m  £  Fk- 1.  P  is  the  direction  for  sepa¬ 
ration  and  corresponds  to  the  line  segment  [x,  y\  with 
slope  £. 


Figure  3:  Rotating  slope  ~  toward  slope  m. 


Lemma  5.1  Slope  £  of  the  line  segment  [x,y\  is 
greater  than  or  equal  to  the  associated  slope  of  any 
(2,  k2,s) -partition  in  direction  parallel  to  P,  where  P  is 
the  direction  of  separation  that  corresponds  to  the  line 
segment  [x,y\. 

Lemma  5.2  Slope  -  is  equal  to  the  associated  slope 
of  a  (2,  fe2,  s) -partition  in  direction  parallel  to  P  if  and 
only  if  at  least  one  of  the  s  separating  lines  intersects 
a  minimal  pair  with  slope  - . 

[2]  obtained  an  exact  formula  for  the  number  of  lin¬ 
ear  partitions  of  the  (i,j)-grid.  Substituting  for  the 
(fc,  fc)-grid  we  obtain  the  following  corollary. 

Corollary  5.1 


Litkatl  =2fc(fc-l)  +  2(fc-1)2+4  (k  —  a)(k  —  b). 

a-L6,0<6<a<ft 
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Lemma  5.3  I^,*2,,**  = 


to  select  a  minimal  pair  with  slope  0  <  £  <  1  to 


f  k2-  Is 

0  /  Jfe  —  1  > 

\  0  /  2k  -  2  \  , 

r 

(  k2- 1  \ 

(  ak  +  bk  —  ab~  1  \ 

(  4  c 

)-2(  *  , 

>-H  .  j+ 

^  X^aX6,0<6<o<fc  ^ 

*  ) 

-(  .  jj 

4  £  [(‘2;1)-( 

a±b,0<b<a<k  L  v  '  x 


ak  +  bk  —  ab  —  1 
s 


Proof  Let  a  slope  £  be  such  that  a  Lb  (we  consider 
slopes  for  directions  between  0°  and  45°.  For  each  such 
slope  Lemmas  5.1  and  5.2  give  a  simple  algorithm  to 
construct  (i.e  generate)  and  count  the  (2,  fe2,s)-partitions 
associated  with 

•  Rotate  slope  ~  to  increase  it  a  bit  {this  gives  the  di¬ 
rection  of  an  associated  (2,  A;2,  s)-partition  P}; 

•  Sort  the  points  along  this  direction; 

•  Choose  s  points  out  of  k2  -  1  points  in 

(  k2  -1  \ 

I  1  ways;  {selected  points  are  begining  of 

classes  Si, . . . ,  S3  and  the  point  xo  =  (k  —  1, 0)  can 
always  be  selected  for  So} 

•  Eliminate  all  selections  of  points  where  no  minimal 
pair  of  slope  ~  is  intersected  by  a  separating  line  of 
direction  P; 

To  intersect  one  of  the  s  separating  lines,  the  lower  end 
of  a  minimal  pair  with  slope  ~  must  be  selected.  From 
Corollary  5.1  we  have 

•  If  b  =  0  then  there  are  k(k  —  1)  lower  ends 

(the  number  of  minimal  horizontal  segments  of  the 
(Ar,  A:)-grid).  None  of  them  can  be  selected  in 

(  *2  “  1  ~  k('k  ^  k  ~  1  )  ways.  So  the 


number  of  ways  to  select  a  minimal  pair  with  slope 

/  k2  -  1  \ 

Y  to  intersect  a  separating  line  is  2  f  1  — 


•  If  a  =  b  =  1  then  there  are  ( k  —  l)2  lower  ends  (the 
number  of  minimal  segments  with  slope  45°).  None 
_ /fc2-l-(fc-l)2  \  _ 


of  them  can  be  selected 

/  2k -2  \  c  . 

I  ^  I  ways.  So  i 


^  ^  J  ways.  So  the  number  of  ways  to  select 

a  minimal  pair  with  slope  y  to  intersect  a  separating 

iineis2(  *  r1  )_2(  2\~2  )• 

If  a  >  b  >  0  then  there  are  ( k  —  a)  (k  —  b) 
lower  ends  (the  number  of  horizontal  rectangles 
of  size  a  x  6  of  the  (fc,  fc)-grid).  None  of  them 

can  b.  selected  in  (  ft*  - 1 -<*-«>(*- «  \  _ 


ak  4-  bk  —  ab  —  1 
s 


Taking  the  total  sum  of  all  three  cases  yields  the  formula. 
This  completes  the  proof.  • 

Theorem  5.1 

*(v 

^  y!  ( ^  "* ^  j 

a±6,0<6<a<fc  '  ' 

Proof  Follows  from  Lemma  5.3  and  the  fact  that  the 
last  sum  is  over  |Ffe_i,fc-i|.  • 

Lemma  5.4  ak  4-  bk  —  ab  —  1  <  k2  —  1. 

Corollaries  5.2  and  5.3  below  give,  respectively,  a 
lower  bound  and  an  upper  bound  on  the  number  of 
(2,  fc2,  $)-partitions  of  the  (fc,  &)-grid. 

Corollary  5.2  L2,*2,s  > 


Corollary  5.3  £2,fc2,s  < 

V)-(V) 


ways.  So  the  number  of  ways 


The  asymptotic  formula  for  the  length  of  the  gen¬ 
eralized  Farey  (i,j) -sequence  (for  i  <  j)  is  given  in  [2] 

as  I-Fijl  =  +  0(i2j  log  j)  +  0(ij2  log  log  j).  Hence, 

substituting  for  the  (&,  fc)-grid  we  obtain  |jF*_i,*_i|  = 
+  0(k3  log  k)  +  0(k3  log  log  k). 

Theorem  5.2  L2>k2,3  ~ 

Proof  We  take  the  mean  of  the  lower  and  upper  bounds 
and  replace  |Ffc-i,A:-i|  by  its  formula  given  above.  • 

Corollary  5.4  The  number  of  2-input  k-valued  s- 
separable  logic  functions  is  \F^3\  <  k(k  -  l)sL2,fc2>5. 

The  example  given  in  figure  2  explains  the  inequality 
in  Corollary  5.4.  For  instance,  the  XOR  funcion  has 
exactly  two  (2,4,2)-partitions.  For  s  =  1  we  obtain  an 
asymptotic  formula. 

Corollary  5.5  The  number  of  permutably  homoge¬ 
neous  2-input  k-valued  s -separable  logic  functions  is 

\GlJ  W  (fe-s-l)!^2,fc2,«* 
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6  Complexity  of  counting  the  number 
of  (2,  k2,  s)-partitions 

Given  a  point  set  V  in  the  plane,  there  are  ^L-~L  pairs 
of  points.  Generally  speaking,  each  pair  requires  v  -  2 
tests  to  check  whether  it  is  minimal.  Thus  the  com¬ 
plexity  of  the  general  case  for  the  extraction  of  all  min¬ 
imal  pairs  is  0(v3).  However,  if  the  considered  v  =  ij 
points  are  all  the  points  of  the  (i,  j)-grid,  then  this  algo¬ 
rithm  turns  out  to  be  linear  with  respect  to  v,  namely 
0(v),  on  the  basis  of  Corollary  5.1  and  the  fact  that 
the  successor  of  a  member  of  Fi,j  can  be  calculated  in 
a  constant  time  [2].  The  simple  linear  time  algorithm 
for  generating  is  described  in  [2]. 

In  Figure  4  we  show  the  fast  algorithm  for  counting 
(2,fc2,s)-partitions  of  the  (fc,  fc)-grid.  It  is  a  modified 
version  of  the  [2]  algorithm  for  enumerating  the  linear 
partitions  of  the  (i,  j)-grid,  so,  the  reader  is  refeered  to 

[2]  for  a  complete  proof  of  correctness  of  the  algorithm. 
The  modifications  include  replacing  the  (ij)-grid  by 
the  (fc,  fc)- grid  and  computing  the  binomial  coefficients 
present  in  Lemma  5.3. 


ficients  is  O(s).  Therefore,  for  each  generation  of  a 
number  |  €  Fk-itk-u  it  takes  0(s)  to  compute  the 
number  of  (2,  A;2,  s)-partitions  associated  with  -.  Since 
there  are  k 2  elements  in  the  (fc,  fc)-grid,  then  it  takes 
0(sk2)  <  0(k4)  time  to  calculate  Clearly,  the 

time  complexity  is  polynomial  on  k. 

7  Conclusion  and  further  research 

In  this  paper  we  have  introduced  the  concepts  of 
multilinear  separability  and  multilinear  partitions,  and 
derived  formulae  for  the  number  of  multilinear  parti¬ 
tions  of  subsets  in  general  position  and  of  the  (fc,  fc)- 
grids.  A  more  difficult  problem  would  be  to  enumerate 
partitions  for  higher  dimension  grids.  One  interesting 
open  question  is  the  following.  In  how  many  ways  can 
we  partition  a  finite  set  V  C  Rn  using  s  non-necessarily 
parallel  hyperplanes?  The  answer  to  this  question  gives 
(bounds  on)  the  capacity  of  some  classes  of  neural  net¬ 
works. 
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Abstract 

In  this  paper ,  a  B-ternary  logic  based  asynchronous 
pipeline  is  presented .  The  pipeline  processes  binary 
data  elastically .  It  has  high  speed  operation  potential 
in  spite  of  having  an  idle  phase ,  because  the  stages  of 
the  pipeline  operate  concurrently .  The  mechanism  for 
correct  pipeline  behavior  and  the  designed  circuits  are 
provided . 

1.  Introduction 

Pipeline  architecture  is  widely  used  for  high-speed 
computations,  for  instance,  image  processing,  arith¬ 
metic  units  or  instruction  decoding  within  computers. 
Data  in  a  pipeline  system  are  concurrently  processed 
at  successive  stages.  Thus,  for  this  feature,  memory  el¬ 
ements  and  processing  logic  circuits  appear  alternately. 
Originally,  an  asynchronous  pipeline  was  designed  by 
I.  E.  Sutherland  [1],  It  has  been  called  an  event  driven 
micropipeline.  Pipelines  can  of  course  be  implemented 
using  either  clock-controlled  or  Request /Acknowledge 
controlled  circuitry.  Some  forms  of  pipeline  are  inelas¬ 
tic  and  look  like  shift-registers.  In  an  elastic  pipeline, 
not  only  is  the  amount  of  data  variable  but  so  is  the 
input/output  speed.  The  pipeline  behaves  as  a  first- 
in  first-out  queue  (FIFO).  Asynchronous  processing 
schemes  possess  some  significant  advantages  for  such 
a  pipeline.  These  include  simpler  design  results  and 
circuitry  suited  to  implementing  elastic  systems.  In 
contrast,  it  was  shown  that  the  implementation  of  the 
elastic  pipeline  as  a  synchronous  system  is  significantly 
more  complicated  [1]. 

In  this  paper,  a  B-ternary  logic  [2]  based  asyn¬ 
chronous  pipeline  is  considered.  The  pipeline  presented 
processes  binary  data  elastically.  It  has  high  speed  op¬ 
eration  potential,  in  spite  of  having  an  idle  phase,  be¬ 


cause  the  stages  of  the  pipeline  operate  concurrently. 
To  implement  the  design,  we  employ  ternary  AND, 
OR,  NOT  gates  and  C-elements  [5].  The  operations 
of  AND,  OR  and  NOT  are  min,  max  and  complement, 
respectively.  (For  details  of  the  CMOS  implementation 
of  these  elements,  refer  to  [5].)  The  ternary  C-element 
is  a  binary  memory  element  that  accepts  ternary  in¬ 
put  signals.  Figure  1  shows  the  ternary-in,  binary-out, 
Muller  C-element,  which  contains  a  kind  of  inverter 
^'{°)i./2}  ^-{0,1/2}  jg  saj^  to  be  a  down-literal  .  It 

performs  as  XWV  =  (  J  |  1  ^  ^  *  The 

C-element  is  set  to  1  when  its  input  is  1,  it  resets  to  0 
when  the  input  is  0  and  it  captures  the  previous  data 
value  when  the  input  is  1/2.  In  practice,  the  value  1/2 
is  filtered  out  by  means  of  a  threshold  in  the  down- 
literals.  This  also  avoids  metastability  in  the  binary 
SR-FF  of  Figure  1. 

{0,1/2} 


C' 


Figure  1.  Ternary-in  binary-out  C-element. 
upper:  circuit  lower  left:  symbol 
lower  right:  truth  table. 

The  C-element  is  useful  in  data  paths  of  asyn- 
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chronous  systems  to  separate  the  stages  in  a  data 
stream.  We  exploit  a  three  state  coding  which  has 
three  detectable  states:  logical  0,  logical  1  and  a  null 
state  (spacer)  1/2.  In  a  data  stream,  bits  of  data  (0 
or  1)  and  spacers  (1/2)  appear  alternately  to  separate 
successive  data  bits.  This  three  state  coding  is  also 
referred  to  as  2-phase  data  transferring. 

The  advantages  of  the  B-ternary  logic  based  asyn¬ 
chronous  system  are  as  follows. 

1.  It  reduces  the  interconnection  complexity  in  an 
asynchronous  system  (almost  twice  the  connec¬ 
tions  of  the  synchronous  case)  [3]  [4]  [5]. 

2.  The  number  of  transistors  in  ternary  circuits 
(gates)  is  nearly  equal  that  of  2-rail  logic.  Note 
that  the  binary  asynchronous  system  based  on  2- 
rail  logic  is  equivalent  to  the  B-ternary  system 
[5], 

3.  B-ternary  circuits  can  be  directly  connected  to 
binary  ones  without  data  encoding/decoding. 

4.  Logic  circuits  for  the  purpose  of  data  processing 
can  be  designed  using  CAD  tools  intended  for 
binary  synchronous  circuits  (e.g.  espresso). 

5.  The  presented  ternary-in  binary-out  memory 
(Muller  C-element)  avoids  metastability. 

In  the  following  section,  the  basis  of  a  micropipeline 
(simply  pipeline)  and  a  design  of  a  B-ternary  asyn¬ 
chronous  pipeline/FIFO  are  explained. 


of  the  pipeline  at  certain  moments  in  time,  and  the 
#1~#4  are  the  stages  of  the  circuit.  State  transitions 
of  ml~m5  show  data  transfers:  #2— >-#3,  #1— >>#2, 
#4-x>ut,  and  #3->-#4,  respectively.  The  transi¬ 
tion  m5-»m6  indicates  a  concurrent  data  transfer  of 
#2— )-#3  and  #4— K>ut.  Finally,  the  data  in  #3  moves 
to  #4  during  m6“Hn7. 


\ 

(m7)  — * 


Figure  3.  Data  transfer  examples  in  an  elastic 
pipeline. 

To  summarize  the  above  considerations, 


2.  Pipeline 


A  block  diagram  of  the  general  pipeline  is  given  in 
Figure  2.  As  shown,  memory  elements  (Mem.)  ap¬ 
pear  between  the  processing  logic  circuits  (L.C.  :  com¬ 
binational  circuit)  throughout  the  pipeline.  The  con¬ 
troller  connected  to  each  memory  determines  whether 
the  data  is  stored  in  the  memory  or  not  (retain).  Once 
data  is  stored  it  is  processed  by  the  next  L.C.  and 
the  result  will  be  stored  in  the  next  memory  element. 
Therefore,  in  the  case  of  the  asynchronous  pipeline,  the 
memory  controllers  resolve  the  timing  of  data  transfers 
through  the  pipeline. 


"data^)  Mem.  L.C.  S  Mem.  L.C.  S 


n 

r  i 

Ctrl 

- * 

*[ct 

Figure  2.  Generalized  micropipeline. 


The  behavior  of  the  asynchronous  pipeline  is  il¬ 
lustrated  in  Figure  3.  The  ml~m7  are  the  states 


(al)  data  transfer  between  successive  stages  progresses 
sequentially ,  or 

(a2)  data  transfers  in  two  or  more  not  necessarily  ad¬ 
jacent  pairs  of  stages  can  progress  concurrently . 

Thus,  from  Figure  2  and  3,  we  see  that  each  datum  can 
move  to  the  next  place  if  it  is  empty,  and  moreover, 
these  transfers  can  be  done  in  parallel  on  the  above 
conditions  (al)  and  (a2). 

The  circuit  that  consists  of  only  memories  and  con¬ 
trollers,  omitting  L.C.’s,  is  an  elastic  FIFO.  In  the  fol¬ 
lowing,  since  no  generality  will  be  lost,  we  use  a  FIFO 
rather  than  a  general  asynchronous  pipeline. 


3.  Design  of  a  FIFO  and  its  sub-circuits 

In  this  section,  design  of  a  FIFO  and  its  sub¬ 
circuits  are  explained.  Figure  4  illustrates  the  general¬ 
ized  ternary  asynchronous  FIFO.  The  FIFO  consists  of 
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ternary-in  binary-out  Muller  C-elements,  request  (Req) 
circuits,  and  Controllers  (ctrl’s). 


Figure  4.  Generalized  simple  ternary 
asynchronous  FIFO. 


For  the  FIFO  of  Figure  4  to  operate  correctly,  the  fol¬ 
lowing  conditions  are  required: 

(1)  If  both  of  the  following  hold,  data  transmission  can 
be  performed . 

(a)  The  memory  (i.e.  C-element)  of  a  certain  stage 
has  data  and  it  is  stable . 

(b)  The  memory  of  the  next  stage  of  (a)  is  empty. 

(2)  If  either  (a)  or  (b)  (or  both)  does  not  hold ,  then 
the  memory  keeps  the  current  data. 

Note  that  memory  empty  means  that  a  C-element 
has  transmitted  its  current  data  to  the  next  C-element 
and  the  next  C-element  has  stabilized.  Under  these 
conditions,  we  are  able  to  have  two  types  of  behavior: 

(%)  sequential  behavior  of  data  transmission  and  wip¬ 
ing  (with  1/2)  in  the  data  path , 

(ii)  concurrent  behavior  of  data  transmission  and  wip¬ 
ing  (with  1/2)  in  the  data  path . 


Figure  5.  Sequential  (left)  and  concurrent 
(right)  behaviors  of  data  transmission  and 
wiping  with  1/2. 


Each  behavior  can  be  expressed  by  a  signal  transition 
graph  (STG)  as  shown  in  Figure  5,  where  we  use  the 
symbols:  t:  Req  to  transfer  data,  w:  wiping  with  1/2, 
and  s:  C-element  stabilized. 

To  achieve  the  above  conditions  with  B-ternary  logic 
circuits,  we  need  the  following  sub-designs. 

(A)  Ternary-in  binary-out  Muller  C-element  with 
stable-state  detector. 

Under  the  unbounded  delay  model,  it  is  not  suffi¬ 
cient  to  consider  the  signal  on  the  input-line  of  the 
memory  to  know  whether  the  memory  has  stored  the 
signal  value  or  not.  Due  to  gate  delays,  there  are  some 
possibilities  of  progression  to  next  behavior  before  the 
C-element  stabilizes.  Thus  we  combine  the  C-element 
with  a  coincident  circuit  to  detect  a  stable  memory 
state  as  shown  in  Figure  6.  The  circuit  outputs:  0 
when  the  value  in  memory  is  changing;  1  when  there 
is  no  change  or  the  value  has  changed  completely;  1/2 
when  the  input-signal  is  1/2. 


Figure  6.  C-element  with  ternary  coincident 
circuit. 

(B)  Data-gating  circuit  by  Req-signal. 

The  data-gating  circuit  is  located  at  the  output  side 
of  memory.  The  gate  opens  to  transmit  stored  data 
from  the  preceding  C-element  to  the  next,  or  to  hold 
the  data  back  while  transmitting  1/2  instead.  Let  the 
data- bit  be  a ,  the  circuit  can  be  expressed  with  re¬ 
stricted  Req  signal  x  E  {1/2, 1}  as  follows, 


g  =  a  *  xV  ~  x 


(i) 
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where  ”V”  and  indicate  ternary  min,  max  and 
complement  operations,  respectively.  The  circuit  for 


this  equation  is  shown  in  Figure  7.  It  outputs  1/2  when 
Req  signal  x  =  1/2  or  it  outputs  data  a  when  x  =  1. 

*={0. 1} 

(data) 


*={1/2,1} 

(Req) 

Figure  7.  Restricted  data-gating  circuit. 

(C)  Data  empty  detection  circuit. 

To  make  a  memory  empty  and  to  detect  it,  two  types 
of  behavior  were  mentioned  above:  sequential  and  con¬ 
current  data  transfer  (t)  and  wiping  a  data-path  with 
1/2  (w)  on  the  memory’s  stable  state. 


Figure  8.  Circuitry  for  sequential  empty 
behavior. 

Figure  8  shows  conceptual  circuitry  for  sequential 
empty  behavior  such  as  the  STG  in  Figure  5  (left).  In 
the  circuit,  the  controller  detects  the  stable  state  of  the 
memory  (C-element)  and  then  controls  the  sequential 
events  t  and  w. 


Figure  9.  Circuitry  for  concurrent  empty 
behavior. 

In  Figure  9,  for  concurrent  t  and  w  behavior  on  s,  we 
need  one  more  interconnection  between  the  controllers. 
The  design  detail  for  these  behaviors  is  provided  in  the 
next  section. 


4.  B-ternary  logic  based  pipeline 

For  a  B-ternary  micropipeline,  we  designed  two 
types  of  FIFO  as  shown  in  Figures  10  and  11.  These 


two  micropipelines  (FIFOs)  satisfy  the  desired  condi¬ 
tions  (al)  and  (a2)  mentioned  in  section  2.  Namely, 
these  two  micropipeline  can  perform  concurrent  data 
processing.  However,  they  have  different  procedures  to 
’’empty”  each  memory.  Figure  10  shows  two  stages  of 
a  FIFO  with  the  sequential  data  empty  circuit.  The 
operations  of  this  FIFO  is  as  follows. 


1.  When  data  has  reached  a  memory  (C-element) 
and  the  memory  has  stabilized,  this  condition  is 
indicated  by  a  specific  controller  Ctrl. 

2.  If  both  the  stabilized  signal  and  a  memory  empty 

signal  from  the  next  stage  meet,  the  ctrl  opens 
the  gate  by  Req  signal  (t)  such  as  1/2  1,  and 

the  data  in  the  memory  will  move  to  the  next 
memory  element. 

3.  When  both  the  memory-stable  signal  (s)  and  a 
data  transfer  signal  (t”)  meet  at  the  AND  gate, 
it  properly  is  an  ’’empty  signal”. 


In  contrast,  the  FIFO  in  Figure  11  has  concurrent 
data  empty  connections.  It  behaves  as  follows. 

1.  A  memory  generates  its  stable-state  signal. 

2.  •  If  both  the  stable-state  signal  and  empty  signal 
from  the  next  memory  are  indicated,  ctrl  opens 
the  gate  to  send  data. 

•  Concurrently,  the  preceding  ctrl  closes  the  data 
path  to  the  memory,  that  is,  wiping  the  path  by 
1/2. 
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Figure  12.  STG  for  the  pipeline  in  Figure  10. 


Figure  10.  FIFO  (pipeline)  with  sequential  empty  procedure. 


centerline 


Figure  11.  FIFO  (pipeline)  with  concurrent  empty  procedure. 


Figure  13.  STG  for  the  pipeline  in  Figure  11. 


Signal  transition  graphs  (STGs)  of  these  two  kinds 
of  pipeline  are  shown  in  Figure  12  and  13,  respectively. 
Note  that  these  STGs  are  illustrated  for  the  t  -  s  - 
w  (or  e)  loop  in  Figure  10  (or  Figure  11),  and  sig¬ 
nal  transitions  in  parentheses  are  for  the  surrounding 
loops’  behaviors. 

From  these  STGs,  we  can  design  Ctrl’s  (controllers) 
as  shown  in  Figure  14  and  Figure  15.  These  Ctrl’s  are 
almost  the  same  as  those  presented  in  the  literature  [5] . 
In  Figure  14,  the  down-literal  d\  operates  as  X ^  = 

^  .  x  =  0 

‘  -  /ri  -  Refer  to  section  1  for  d2- 

0  :  X  =  1/2  or  1 


={0, 1/2,1} 


Figure  14.  Ctrl  for  the  pipeline  in  Fig.10. 

The  pipeline  in  Figure  1 1  needs  one  extra  interconnec¬ 
tion  between  Ctrl’s  to  achieve  concurrent  behavior,  to 
wipe  the  data-path  and  enable  the  next  data  transfer. 


Figure  15.  Ctrl  for  the  pipeline  in  Fig. 11. 


In  the  ctrl  in  Figure  15,  the  signal  e’  signals  the  empty 
state  to  the  previous  (i.e.  sending)  ctrl.  One  can  see 
that  both  Ctrl’s  are  quite  simple. 
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V  - s2cout  - slcout 

.  s3cout 


time  nS 


Figure  16.  SPICE  simulation  of  the  circuit  in  Figure  11. 


Figure  16  is  the  result  of  a  SPICE  simulation  of  the 
FIFO  with  concurrent  empty  procedure  (Figure  11). 
Note  that  the  three  pulses  in  the  figure  are  at  the  out¬ 
puts  of  the  three  C-elements.  The  figure  shows  a  prop¬ 
agation  of  a  pulse  through  the  three  C-elements. 

5.  Conclusion 

We  considered  the  behaviour  of  an  elastic  pipeline 
exploiting  asynchronous  systems.  To  achieve  the  de¬ 
sired  system  behavior,  we  designed  two  kinds  of  mi¬ 
cropipeline.  Interestingly,  these  different  behavioral 
pipelines  can  be  controlled  in  almost  the  same  manner. 
We  stress  that  ternary  asynchronous  micropipelines 
have  attractive  advantages;  the  system  processes  bi¬ 
nary  data  elastically,  and  they  have  high  speed  opera¬ 
tion  potential  in  spite  of  having  an  idle  phase,  because 
the  stages  of  the  pipeline  operate  concurrently.  Finally, 
we  found  that  the  STG  of  a  micropipeline  is  relatively 
complicated  due  to  the  interaction  of  behavior  between 
adjacent  stages. 
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Abstract 

Multiple-Valued  Logic  (MVL)  functions  are  implemented 
via  Boolean  multiple-wire  arrangements  where  a  careful 
state  assignment  methodology  is  used  to  ensure  efficient 
implementation  regimes .  A  * power  of  N*  module  is 
proposed  for  GF  (23).  The  method  avoids  the  need  to 
factorize  the  polynomial  and  circuits  can  be  realised 
using  a  combination  of  NOT  AND  and  XOR  functions.  In 
addition,  a  novel  transform  over  GF  (22)  is  proposed 
which  shows  promise  when  compared  to  the  Reed-Muller- 
Fourier  transform ,  in  its  capacity  to  produce  zero  co¬ 
efficients.  A  possible  implementation  strategy ,  using  Field 
Programmable  Gate  Arrays  (FPGAs)  is  briefly  discussed. 

1.  Introduction 

State  assignment  techniques  provide  a  way  of 
implementing  multiple-valued  logic  functions  with 
Boolean  variables.  For  example  Menger  [1]  mapped  the 
elements  of  GF(23)  to  the  set  of  Boolean  vectors  with 
three  co-ordinates.  As  each  co-ordinate  is  a  logic  0  or 
logic  1  there  are  eight  possible  states  so  such  a  mapping  is 
one  to  one  and  onto.  PLUS  and  TIMES  modules  were 
designed  with  Boolean  algebra.  The  PLUS  module  used 
three  inputs  to  carry  the  state  assignment  for  the  first 
element  to  be  added  and  another  three  inputs  for  the 
second.  The  three  outputs  carried  the  state  assignment 
belonging  to  their  sum.  The  TIMES  module  worked 


similarly.  With  these  two  modules  any  function  defined 
over  the  elements  of  GF(23)  into  the  same  elements  can 
be  implemented. 

State  assignment  techniques  are  currently  used  for 
Finite  State  Machines  [2],  where  the  nodes  are  embedded 
into  a  Boolean  N-cube.  To  aim  for  low  power 
consumption  nodes  that  frequently  switch  between  each 
other  are  embedded  so  that  the  hamming  distance  between 
their  assignments  is  as  small  as  possible.  In  [2]  the 
authors  report  that  the  process  of  state  embedding  can  be 
time  consuming  since  state  embedding  is  an  NP-complete 
problem.  Thus  they  used  an  optimal  heuristic  in  their 
algorithm.  Our  approach  to  finding  good  assignments  has 
been  heuristic  rather  than  exhaustive  because  of  the  large 
number  of  possibilities.  For  instance  there  are  40320 
possible  state  assignments  for  GF(23). 

Existing  transform  techniques  for  implementing 
multiple-valued  logic  have  draw  backs.  In  [3]  a  hybrid 
transform  is  proposed  that  can  represent  a  quaternary 
function  by  a  polynomial  where  powers  are  raised  within 
GF(22)  but  standard  algebra  with  real  coefficients  is  then 
used  for  the  rest  of  the  computation.  The  authors 
proposed  a  universal  logic  module  but  regarded  it  as  too 
computationally  expensive  for  implementation.  In  [4]  and 
[5]  the  Reed-Muller-Fourier  transform  is  introduced 
which  uses  the  4EXP  and  4AND  functions  to  generate  the 
basis  for  the  transform.  Then  mod4  arithmetic  is  used 
with  the  coefficients.  The  advantages  of  RMF  are  that  the 
number  of  coefficients  needed  could  be  smaller  than  in 


0-7695-0161-3/99  $10.00  ©  1999  IEEE 


220 


Galois  field  representation,  they  are  also  easily  calculated 
[5],  Their  disadvantage  is  lack  of  circuit  implementation 
of  the  multiplication  and  addition  operations  [3].  Current 
Mode  CMOS  circuits  have  been  used  in  [6]  to  implement 
the  field  operations  over  GF(22)  and  GF(24)  The  circuits 
for  GF(22)  uses  four  possible  current  levels  on  a  single 
wire.  Their  GF(24)  circuits  required  two  wires.  For  large 
fields  using  a  bundle  of  wires  to  carry  the  representation 
may  be  more  practical  than  packing  a  lot  of  separate 
signal  levels  onto  the  same  wire.  Thus  state  assignment 
techniques  will  always  come  under  consideration. 

In  section  two  of  this  paper  it  is  demonstrated  that  for 
any  transform  if  the  matrix  representation  of  the  basis  is 
invertible  over  a  suitable  integer  ring  then  we  can  use 
integer  coefficients  from  that  ring  and  ring  arithmetic  for 
the  computation.  This  leads  to  a  hybrid  transform  over 
GF(22)  similar  to  that  of  [3]  except  that  mod4  arithmetic 
is  used  instead  of  real  arithmetic.  The  efficiency  of  this 
new  hybrid,  in  terms  of  zero  coefficients,  is  comparable  to 
theRMF. 

This  paper  offers  a  way  of  implementing  multiple¬ 
valued  logic  using  state  assignments  that  permit  the  use  of 
simpler  circuit  designs.  Thus  tackles  the  problems 
mentioned  above.  An  assignment  for  GF(23)  leads  to  the 
design  of  a  new  power  of  ‘N’  module.  This  eliminates  the 
need  for  repeated  multiplication  through  cascaded 
modules  just  for  raising  powers.  An  assignment  for 
GF(22)  is  proposed  that  uses  just  one  AND  gate  to 
generate  the  transform  matrix.  Then  the  inverse  is  taken 
using  mod4  arithmetic.  Which  is  useful  because  a  mod4 
adder  is  just  two  binary  stages  ignoring  the  carry  from  the 
second  stage. 

2.  Transforms  using  integer  rings 

In  order  for  a  square  matrix  to  be  invertible  in  an 
integer  ring  it  is  necessary  and  sufficient  that  the 
determinant  of  the  matrix  be  a  unit  in  the  ring  [7].  For  an 
integer  ring  ZJnZ  the  units  are  those  integers  that  are 
relatively  prime  for  n  [8].  For  example  the  powers  of  a 
variable  x  in  GF(22)  has  the  following  matrix, 

Table  1.  Powers  of  x  in  GF(22) 

1  X  X2  X3 

10  0  0 

1111 

12  3  1 

13  2  1 


As  this  matrix  has  determinant  equal  to  one  mod4  a  mod4 
inverse  exists  which  is  as  follows, 

Table  2.  Inverse  mod4  Hybrid  Transform 

10  0  0 

0  112 

0  12  1 

3  3  11 

The  existence  of  this  hybrid  means  that  functions  of 
GF(22)  into  GF(22)  can  be  realised  by  a  hybrid 
polynomial  where  powers  are  raised  in  GF(22)  algebra  but 
the  rest  of  the  calculation  uses  mod4  arithmetic. 

As  an  illustration  f(x)  has  vector  representation 
(2,3,1,0)t  if  we  multiply  this  vector  by  the  matrix  of  Table 
2  we  obtain  the  coefficients  (2,0,1,0)T'  This  means  that 
f(x)  =  2  +  x2  (mod  4). 

All  256  single  variable  functions  over  GF(22)  were 
examined  and  the  least  number  of  coefficients  needed  for 
each  function  was  calculated,  results  being  optimised 
from  the  4  polarities  available  in  GF(4).  An  expected 
frequency  of  2.301  coefficients  was  obtained,  using  the 
results  from  [11]  the  expected  frequencies  for  RMF  is 
2.316  and  the  GF  method  had  2.332. 

3.  An  assignment  of  binary  triples  to  the 
elements  of  GF(23) 

The  motivation  behind  this  assignment  was  to 
examine  the  properties  of  Boolean  algebra  and  GF(23) 
algebra  and  aim  to  use  the  assignment  to  make  operating 
in  GF(23)  as  similar  to  working  in  Boolean  algebra  as 
possible. 

The  zero  element  eo  of  GF(23)  has  the  property  that, 
eo+ep  ei  and  eo*ei=  eo,  also,  using  +  as  the  co-ordinate 
wise  exclusive-or  operation  (0,  0,  0)  +(xl9  x2,  x3)  =  (xi,  x2, 
x3),  likewise  (0,  0,  0)  AND  (xi,  x2,  x3)  =  (0,  0,  0).  So  we 
let  eo  be  assigned  to  (0,  0,  0).  As  eo*ei=  ei  and  (1,  1,  1) 
AND  (xb  x2,  x3)— (xj,  x2,  x3)  we  assign  ei  to  (1, 1, 1). 

In  Boolean  triples  let  X  =  (xb  x2,  x3),  then  the 
complement  of  X  is  (xi,  x2,  x3)+(l,  1,  1).  In  GF(23)  e3+e2 
=  ei,  e5  +  e4  =  eb  e7  +  e6  =  eb  and  e7  +  e6  =  e^  So  if  we 
arrange  that  e3  is  assigned  to  (1, 0, 0),  e5  to  (0, 0,  1)  and  e7 
to  (0,  1,  0).  Then  to  preserve  this  property  with  exclusive- 
OR  we  must  also  assign  e2to  (0,  1,  1),  e4  to  (1,  1,  0)  and 
e6  to  (1,  0,  1).  This  completes  the  assignment.  (See  Table 

3) 

With  this  assignment  the  Boolean  equivalent  of 
multiplication  becomes : 
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(x1,x2,x3)*(yi,y2,y3)  = 
y3xi+(yi+y2)x3+(y2+y3)x2. 
yiX2+(y2+y3)xi+(yi+y3)x3, 
yiX2+(yi+y2)xi+(yi+y3)x2 

The  logic  circuits  for  multiplication  require  six  input 
wires  and  three  output  wires.  The  logic  needed  for  each 
output  was  worked  out  separately  using  a  Karnaugh  maps. 
For  each  fixed  set  of  values  of  the  Y  vector  the 
multiplication  becomes  a  function  of  X  only.  These 
functions  were  simplified  in  a  three  variable  Karnaugh 
map.  Then  each  function  of  X  was  multiplied  by  the 
mintem  that  represented  the  associated  Y  value.  Finally 
all  eight  results  were  added  using  the  exclusive-OR 
operation  and  further  simplified. 

If  a  particular  polynomial  is  to  be  realized  by 
Boolean  modules  the  X  input  (xi,x2,x3)  can  be  the 
variable  and  the  Y  input  can  represent  the  constant  by 
which  it  is  multiplied.  If  we  allow  the  Y  input  to  vary 
over  it’s  range  we  obtain  Table  6.  Inspecting  the  table  we 
see  that  all  the  co-ordinates  are  exclusive-OR  of  the 
various  X  terms.  This  leads  to  a  simple  constant  times  X 
module  which  consists  of  just  two  exclusive-OR  gates 
and  a  strap. 

Also  when  realizing  Galois  polynomials  it  would  be 
useful  to  have  at  hand  the  X  to  the  power  of  N  function. 
The  derivation  of  each  co-ordinate  raised  to  the  power  of 
N  is  given  in  Table  7. 

Inspecting  the  table  reveals  that  the  power  of  N 
module  can  be  realized  by  three  copies  of  the  function 
ffex^Xfc)  =  Xi+Xj  +(xjxk) 

For  a  practical  example  we  consider  a  function  first 
used  by  Menger  [1]  which  is, 

G(X)  =  e5  +  e6X  +  e3x6.  This  factorizes  to, 

G(X) 

=  [e3(X+e1)(X+e5)(X+e4)2+e4] 

(X+ei)(X+e5). 

Which  can  be  realized  with  four  plus  modules  and 
five  times  modules.  Menger  used  the  assignment  given  in 
Table  4.  From  which  he  designed  a  times  module  using  9 
AND  gates  and  8  exclusive-OR  gates.  The  plus  module 
consists  of  only  three  exclusive-or  gates.  Which  means 
that  his  realization  uses  a  total  of  52  exclusive-OR  gates 
and  45  AND  gates.  A  times  module  for  the  assignment  of 
Table  3  would  use  9  AND  gates  and  9  excIusive-OR 
gates. 


Realising  the  function  directly  with  the  power  of  N 
modules  yields  the  network  of  Fig  1  which  uses  10 
exclusive-OR  gates,  3  AND  gates  and  4  NOT  gates.  The 
technique  used  means  that  the  polynomial  did  not  have  to 
be  factorized  and  the  use  of  a  times  module  just  for 
multiplying  by  constants  was  no  longer  needed.  Indeed 
we  never  used  a  times  module  at  all  since  we  never 
multiplied  by  a  completely  unknown  variable. 

Designing  a  constant  times  X  module  for  Menger’s 
assignment  would  need  3  exclusive-OR  gates  compared 
to  our  two.  Menger’s  assignment  also  leads  to  a  more 
complicated  X  to  the  power  of  N  function  which  does  not 
have  the  standard  form  of 

f(xi,Xj,Xk)=  Xi+Xj+tXjXk). 

For  example  with  Menger’s  assignment, 

X3  =  (X!  +  X2X3,  X!  +  X2  +X3  +  XjX3, 

X!X2+X3+X2X3) 

4.  A  method  of  implementing  two  variable 
functions  over  GF(22) 

In  this  section  we  develop  a  method  of  implementing 
two  variable  functions  over  GF(22)  with  two  wires,  each 
wire  either  carries  a  logic  1  or  a  logic  0.  We  use  the 
natural  binary  assignment  e0  to  (0,0),  e!  to  (0,1),  e2  to 
(1,0)  and  e3  to  (1,1).  Using  the  arithmetic  basis  of  [9] 
leads  to  the  following  matrix  representation  of  the  basis: 

1  Xi  X2  XiX2 

10  0  0 

L=  1100 
10  10 
1111 

This  matrix  has  an  inverse  over  the  ring  of  integers 
mod  4  which  is: 

10  0  0 

3  10  0 

3  0  10 

13  3  1 

So  in  line  with  the  ideas  presented  in  section  one  a 
hybrid  transform  exists  where  any  quaternary  function 
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can  be  realized  as  a  linear  combination  of  these  functions 
with  coefficients  and  calculations  taken  mod4. 

For  two  variables  we  can  take  the  Kronecker  product 
of  L  with  itself  and  use  four  wires.  The  inverse  for  this 
will  be  the  Kronecker  square  of  the  inverse  of  L.  As  an 
example  we  will  again  implement  the  two  variable 
function  given  in  [3]  by  Falkowski  and  Rahardja.  In  Table 
5  we  rewrite  the  function  with  two  variables  Z\  and  z2 
where  Xi,  x2  are  the  wires  assigned  to  Z\\  x3,x4  to  z2.  Our 
method  leads  to  the  following  implementation: 

F(  zu  z2 )  =  F(xi,  x2#  x3,  X4)  =  3xi+  x2+  xix2+  x3 
+3X!X3  +2X!X2X3+  2X4+  X1X4+  XiX2X4+  2x3x4 
+  2x2x3x4+  2xtx2x3x4 

where  for  example  2x2x3X4  is  worked  out  by  the  logical 
AND  of  x2  and  x3  and  x4  then  the  result  which  is  either  a  1 
or  a  0  is  multiplied  by  2.  All  additions  are  carried  out  mod 

4,  where  a  mod  4  adder  is  simply  a  two  stage  binary 
adder  where  we  can  ignore  the  carry  digit  from  the  second 
stage. 

5.  A  new  set  of  basis  functions  for  GF(22) 

Considering  the  matrix  L  of  the  previous  section,  as 
this  matrix  and  its  inverse  has  a  triangular  structure,  we 
can  consider  an  extended  set  of  24  polarities  obtained  by 
permutating  the  rows  of  L  in  a  similar  manner  to  that 
suggested  for  the  RMF  transform  in  [11].  A  comparison 
of  results  for  the  RMF  and  L  across  all  256  one  variable 
functions  of  GF(22)  is  contained  in  the  table  below.  Each 
set  being  optimised  over  24  polarities.  The  number  of 


functions  realizable 

;  with  t  products  is  shown  by: 

t 

RMF 

L'1 

0 

1 

1 

1 

57 

33 

2 

138 

138 

3 

60 

84 

4 

0 

0 

2.004 

2.191  (Expected  Frequencies) 

Our  use  of  the  basis  represented  by  L  is  a  special 
case  of  logic  developed  in  [10].  Where  it  was  shown  that 
for  n  variables  any  set  of  2n  functions  that  are  linearly 
independent,  when  added  using  exclusive-or,  will  form  a 


basis.  Thus  a  transform  in  terms  of  these  functions  exists. 
As  the  only  constraints  on  our  choice  of  functions  is  linear 
independence  we  are  free  to  change  this  basis  into  many 
alternative  forms. 

Considering  the  matrix  L,  as  each  X*  is  Boolean,  we 
can  rewrite  this  transform  in  any  one  of  four  polarities. 
The  last  column  being  an  AND  function.  We  went 
further  and  choose  as  last  columns  all  the  Boolean 
functions  that  were  linearly  independent  of  the  first  three 
columns.  These  turned  out  to  be  the  functions  with  either 
one  logic  1  or  three  logic  l’s  and  easy  to  implement.  This 
gave  a  total  of  32  possibilities.  These  transform  functions 
can  be  used  in  any  situation  whether  a  state  assignment 
technique  is  used  or  not.  A  software  simulation  was  used 
to  implement  all  256  single  variable  functions.  We  were 
able  to  choose  the  polarity  and  logic  function  to  minimise 
the  number  of  coefficients  these  coefficients  had  an 
expected  frequency  of  2.051. 

6.  Conclusion 

Field-Programmable  Gate  Arrays  offer  a  promising 
structure  for  building  the  circuits  suggested  in  this  paper. 
Over  GF(23)  the  homogeneous  nature  of  the  power  of  ‘N’ 
module  in  each  coordinate  means  that  the  complete  design 
is  just  the  duplication  of  smaller  units.  The  strapping 
arrangement  for  the  constant  times  X  module  could  be 
done  in  switches.  In  [  12  ]  Vranesic  points  to  a  possible 
future  where  the  data  running  through  a  FPGA  will  be 
multi-valued  but  the  control  bits  would  remain  digital. 
Using  state  assignments  may  bring  us  half  way  to  this 
future.  Using  our  GF(23)  assignment  a  Universal  Logic 
Module  can  be  built  that  can  implement  any  function. 
Inspecting  Table  7  again  we  see  that  only  three  power  of 
‘N’  modules  are  actually  needed.  Other  powers  are  got 
by  permutation  of  inputs  or  simple  logic.  If  constant 
times  X  modules  are  used  to  multiply  by  coefficients  then 
we  can  consider  the  strapping  arrangements  as  the  control 
bits  which  generate  any  given  function.  Vranesic  in  [  12  ] 
raised  the  coming  possibility  of  implementing  4-valued 
Look  Up  Tables.  This  means  we  could  see  non-Boolean 
state  assignments  that  can  easily  handle  larger  fields.  For 
example  GF(26)  on  three  wires.  Logic  modules  for  4EXP 
and  4AND  can  be  designed  using  state  assignment 
techniques.  The  4EXP  is  similar  to  a  X  to  the  power  of  N 
module.  Work  still  needs  to  be  done  to  decide  which 
assignment  is  optimal  also  to  achieve  RMF  design  in 
Larger  fields.  On  the  negative  side  state  assignment 
techniques  bring  with  them  the  disadvantages  of  extra 
wiring.  At  what  stage  this  will  cease  to  be  economic  as 
assignments  are  applied  to  larger  fields  remains  to  be 
found  out. 
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Table  5.  A  two  variable  function  over  GF(22) 


Z1 

z2 

F(Z!,Z2) 

x4 

x3 

x2  X1 

0 

0 

0  0 

0 

0 

0 

0  1 

3 

0 

0 

1  0 

1 

0 

0 

1  1 

1 

0 

1 

0  0 

1 

0 

1 

0  1 

3 

0 

1 

1  0 

2 

0 

1 

1  1 

3 

1 

0 

0  0 

2 

1 

0 

0  1 

2 

1 

0 

1  0 

3 

1 

0 

1  1 

2 

1 

1 

0  0 

1 

1 

1 

0  1 

0 

1 

1 

1  0 

0 

1 

1 

1  1 

2 

Table  6.  Constant  times  X 

Y  Times  X  Co-ordinates 

Y 

First 

Second 

Third 

eo 

0 

0 

0 

C1 

X1 

X2 

X3 

C2 

VX3 

X3 

Xl+X2+X3 

C3 

X3 

X2+X3 

Vx2 

*4 

X2 

Xl+X2+X3 

VX3 

C5 

Xl+X2 

VX3 

X2 

C6 

X  +  X+X„  X  +x. 

12  3  12 

X1 

C7 

x  +x 
!  2  3 

X1 

Xl+X3 
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Table  3.  An  assignment  for  GF(23) 


Table  4.  Menger’s  assignment 


ELEMENT 

ASSIGNMENT 

ELEMENT 

ASSIGNMENT 

X 

X 

X 

X 

X 

Xl 

Xl 

x2 

1 

2 

3 

e 

eo 

0 

0 

0 

0 

0 

0 

0 

e 

C1 

0 

1 

0 

1 

1 

1 

1 

e 

2 

0 

0 

1 

C2 

0 

1 

1 

e 

3 

0 

1 

1 

C3 

1 

0 

0 

e 

1 

0 

0 

4 

C4 

1 

1 

0 

C5 

1 

1 

0 

e5 

0 

0 

1 

C6 

1 

0 

1 

C6 

1 

0 

1 

e7 

1 

1 

1 

C7 

0 

1 

0 

Table  7.  X“ 

Assignment  Co-ordinates 


First 

Second 

Third 

X 

X1 

X2 

X3 

x2 

X2 

X3 

X1 

x3 

X3+Xi+(XiX2) 

Xi+X2+(X2X3) 

X2+x3+(X3X!) 

x4 

X3 

X1 

X2 

x5 

X2+X3+(X3X!) 

X3+Xi+(XiX2) 

X1+x2+(X2X3) 

x6 

Xi+X2+(X2X3) 

X2+x3+(X3Xi) 

X3+Xl+(XiX2) 

x7 

x  OR  x2  OR  x3 

X1  OR  x^  OR  x^ 

x10Rx20Rx3 

Figure  1.  A  network  for  G(X)=  es+  e$+  e3X<5 
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Abstract 

In  this  paper ,  the  theory  of  information  relationships  and 
relationship  measures  is  considered  and  its  application  to 
logic  design  is  discussed.  This  theory  makes  operational 
the  famous  theory  of  partitions  and  set  systems  of 
Hartmanis.  The  information  relationships  and  measures 
enable  us  to  analyze  relationships  between  the  modeled 
information  streams  and  constitute  an  important  analysis 
apparatus  that  can  be  used  for  analysis  and  synthesis  of 
various  information  systems .  It  can  be  applied  in  logic 
design ,  pattern  analysis,  machine  learning,  and  other 
fields.  This  paper  shows  how  to  apply  the  relationships 
and  measures  to  logic  design  when  using  as  examples 
three  important  problems:  input  support  minimization, 
parallel  decomposition  and  serial  functional 
decomposition.  The  application  examples  and 
experimental  results  presented  in  the  paper  demonstrate 
the  high  potential  of  the  information  relationships  and 
measures  in  solving  logic  synthesis  problems. 

1.  Introduction 

The  opportunities  created  by  modem  microelectronic 
technology  cannot  be  fully  exploited,  because  of 
weaknesses  of  the  traditional  logic  synthesis  methods. 
Particularly  in  the  case  of  (C)PLDs,  look-up  table  FPGAs 
and  complex  CMOS  gates,  the  constraints  are  imposed 
not  on  the  function  type  a  certain  block  can  implement, 
but  on  various  building  block  structural  parameters  (e.g. 
the  number  of  inputs,  outputs,  product  terms  in  a  building 
block  or  the  number  of  serial  and  parallel  transistors  in  a 
gate)  and  on  interconnections  between  the  building 
blocks.  A  block  is  able  to  implement  any  function  with 
limited  dimensions.  On  the  other  hand,  the  traditional 
logic  synthesis  methods  do  not  consider  hard  structural 
constraints.  In  principle,  they  are  devoted  to  only  some 
very  special  cases  of  possible  implementation  structures 
involving  some  minimal  functionally  complete  systems  of 
logic  gates  (e.g.  AND+OR+NOT,  AND+EXOR,  MUX). 
They  require  a  post  synthesis  technology  mapping  for 


another  implementation  structures.  If  the  actual  synthesis 
target  strongly  differs  from  these  minimal  systems,  e.g. 
involves  a  lot  of  complex  gates,  look-up  table  FPGAs  or 
(C)PLDs,  any  technology  mapping  cannot  guarantee  a 
good  result  if  the  initial  synthesis  was  performed  without 
close  relation  to  the  actual  target. 

Therefore,  there  is  presently  much  research  in  the  field  of 
general  (functional)  decomposition  of  combinational 
circuits  and  sequential  machines  [3]~[10]  [12]  [14]  [15] 
[17]  [18]  [20]-[22].  The  most  promising  recent 

approaches  in  pattern  analysis,  knowledge  discovery, 
machine  learning,  decision  systems,  data  bases,  data 
mining  etc.  are  also  based  on  general  functional 
decomposition  [11]  [12]  [13]  [16]  [19]  [23].  In  [5],  I 
presented  the  theory  of  general  decomposition  and 
explained  how  the  theory  can  be  used  for  the  synthesis  of 
sequential  and  combinational  circuits.  For  large  circuits 
however,  the  number  of  possible  decompositions  is  so 
great  that  it  becomes  necessary  to  construct  only  the  most 
promising  decompositions,  using  the  general 
decomposition  theorems  presented  in  [5]  together  with  the 
appropriate  heuristic  evaluation  functions  and  selection 
mechanisms.  These  evaluation  and  selection  mechanisms 
must  limit  the  search  space  to  a  manageable  size  while 
keeping  high-quality  solutions  in  the  limited  space.  In 
particular,  the  analysis  and  evaluation  of  relationships 
between  information  in  various  information  streams  of  a 
considered  system  is  here  of  primary  importance.  In  [8]  I 
proposed  the  fundamental  analysis  apparatus  for  this  aim: 
the  theory  of  information  relationships  and  measures. 

The  information  relationships  and  measures  serve  for 
analysis  and  estimation  of  information  and  information 
interrelationships  in  discrete  systems,  as  modeled  by  any 
sort  of  discrete  relations,  functions  and  sequential 
machines.  They  can  be  applied  for  both  the  binary  as  well 
as  the  multiple-valued  and  symbolic  systems,  in  many 
fields  of  modem  engineering  and  science,  including  logic 
and  architecture  synthesis  for  VLSI  systems  [3]  [10]  [12] 
[14]  [15]  [17]  [18]  [20]-[22],  pattern  analysis,  knowledge 
discovery,  machine  learning,  decision  systems,  data  bases, 
etc.  [11][12][13][16][19][23].  Results  of  the  relationship 
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analysis  make  it  possible  to  discover  the  nature  of  the 
considered  system  or  to  decide  its  structure  in  order  to 
satisfy  given  design  constraints  or  optimize  certain 
objectives. 

The  theory  of  information  relationships  and  measures 
makes  operational  the  famous  theory  of  partitions  and  set 
systems  of  Hartmanis  [2].  Partitions  and  set  systems 
enable  us  to  model  information  streams.  The  information 
relationships  and  measures  enable  us  to  analyze  the 
relationships  between  the  modeled  information  streams. 

2.  Representation  of  information  in  discrete 
information  systems 

Information  is  represented  in  discrete  systems  by 
values  of  some  signals  or  variables.  Let’s  consider  a 
certain  finite  set  of  elements  S  called  symbols. 
Information  about  symbols  (elements  of  S)  means  the 
ability  to  distinguish  certain  symbols  from  some  other 
symbols.  Knowing  a  certain  value  of  a  certain  attribute, 
signal  or  variable  jc,  it  is  possible  to  distinguish  a  certain 
subset  B  of  elements  from  £  from  all  other  elements  of  S , 
but  it  is  impossible  to  distinguish  between  the  elements 
from  B.  For  example,  if  we  consider  an  incompletely 
specified  multiple-output  Boolean  function  in  Fig.  1, 
where  symbols  1,...,5  represent  terms  (cubes)  on  input 
variables  Xt,...,  x*,  then  different  values  of  the  input 
variable  x3  enable  us  to  distinguish  between  the  subset  (1, 
2,  3}  and  {4,  5}.  For  1,  2  and  3:  x3  =  0  and  for  4  and  5:  x3 
=  1.  Thus,  knowing  that  x3  =  0,  we  know  that  1,  2  or  3 
occurred,  but  we  do  not  know  which  of  the  three  symbols 
occurred.  Knowing  that  x3  =  1,  we  know  that  4  or  5 
occurred,  but  we  do  not  know  which  of  the  two  symbols 
occurred.  In  such  a  way  information  is  modeled  with 
partitions  and  set  systems  [1][2][5]. 

A  set  system  SS  on  a  set  S  is  defined  as  a  collection  of 
nonempty  and  distinct  subsets  Bh  B2,  Bk  of  S  called 
blocks,  such  that: 

(J.  B{  =  S  and  B{  Bj  for  i  *  j 

A  generalized  set  system  (blanket  [23])  SS  on  a  set  S 
is  defined  as  a  collection  of  nonempty  and  distinct  subsets 
BhB2>  ...,5*ofS  such  that:  |Jf£f  =  £. 

The  set  system  algebra  is  presented  in  [1].  The 
product  and  sum  operations  are  defined  for  set  systems 
as  follows: 

SSj  •  SS2 =  Max  (  {BSSi  o  Bss2 1  Bssi  €  SSi  a 
Bss2  €  SS2}  )j  and  SSj  SS2 =  !S4ax  ( SS^  ^  SS2)j 
where:  Max(  {B*  |  BjcS  } )  =  {B  |  Be{Bi}  a  ((B’e{Bi}  a 
BcB’)  =>  B=B’)  }  (Max  removes  blocks  contained  in 
other  blocks). 

SS!  is  smaller  than  or  equal  to  SS2:  SSj  <  SS2  if  and 
only  if  each  block  of  SS]  is  included  in  a  block  of  SS2. 

A  set  system  SS  on  S  can  be  interpreted  as  a 
compatibility  relation  defined  on  S,  with  the  compatibility 
classes  being  the  blocks  of  SS.  Such  a  compatibility 
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Figure  1.  Incompletely  specified  multi¬ 
output  Boolean  function 

relation  is  reflexive  and  symmetric  but  is  not  required  to 
be  transitive.  If  it  is  transitive,  i.e.  the  subsets  Bj  of  the 
corresponding  set  system  SS  are  disjoined,  then  it  is  an 
equivalence  relation  and  the  set  system  SS  is  a  partition. 
A  certain  set  system  SS  (partition  P)  gives  information 
about  the  elements  of  S  with  limited  precision  to  the 
compatibility  (equivalence)  class.  With  this  information  it 
is  possible  to  distinguish  the  different  classes  although  it 
is  impossible  to  distinguish  between  the  elements  of  the 
same  class.  The  set  system  product  can  be  interpreted  as  a 
product  of  the  corresponding  relations;  it  represents 
combined  information  about  the  elements  of  S  that  is 
provided  by  the  relations  together.  The  set  system  sum 
can  be  interpreted  as  the  sum  of  the  corresponding 
relations  and  it  represents  information  about  elements  of  S 
after  applying  the  combined  abstraction  provided  by  the 
relations  involved.  The  partial  ordering  relation  denotes 
the  fact  that  if  SSj  <  SS2,  then  SSi  provides  the  same  or 
more  information  about  elements  of  S  than  SS2.  In  this 
paper,  we  will  use  the  term  set  system  to  designate  a  set 
system,  generalized  set  system  or  partition,  everywhere 
where  this  will  not  lead  to  misunderstanding. 

When  analyzing  or  designing  information  systems,  we 
are  interested  in  relationships  between  information 
streams  which  express  such  properties  as  similarity 
(common  information),  dissimilarity  (different 
information),  missing  information,  extra  information,  etc. 
If  we  want  to  consider  the  interesting  relationships  and  to 
base  the  design  decisions  on  results  of  such  analysis, 
introduction  of  an  adequate  analysis  apparatus  is 
necessary.  This  is  however  impossible  before  answering 
the  question:  “what  information  is  modeled  by  a 
certain  set  system?”  To  answer  this  question,  it  was 
necessary  to  discover  what  is  an  elementary  (atomic) 
portion  of  information,  i.e.  such  portion  that  any  other 
information  (for  example  as  modeled  by  a  set  system)  can 
be  expressed  as  a  composition  of  such  atomic  portions. 

I  proposed  the  following  solution  [8]:  an  elementary 
information  describes  the  ability  to  distinguish  a  certain 
single  symbol  s,  from  another  single  symbol  Sj,  where:  Sj, 
Sj  eS  and  Sj  *Sj.  Any  set  of  atomic  portions  of  information 
can  be  represented  by  an  information  relation  I, 
information  set  IS  and  information  graph  IG  defined 
on  SxS  as  follows: 

I  =  {(sj,  Sj)  |  Sj  is  distinguished  from  Sj  by  the  modeled 
information}, 
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IS  =  {{Sj,  Sj}  |  Sj  is  distinguished  from  sj  by  the 
modeled  information},  and 

IG  =  {S,  {{Si,  Sj}  |  Sj  is  distinguished  from  Sj  by  the 
modeled  information} }. 

In  a  strictly  analogous  way,  the  notion  of  the 
elementary  (atomic)  abstraction,  and  the  appropriate 
abstraction  (compatibility)  relation  A,  set  AS  and  graph 
AG  can  be  introduced  [8]. 

Relationships  between  set  systems  can  now  be 
analyzed  by  considering  the  relationships  between  their 
corresponding  information  and  abstraction  relations,  sets 
and  graphs.  In  particular,  the  correspondence  between  SS, 
IS  and  AS  is  as  follows:  IS  contains  the  pairs  of  symbols 
that  are  not  contained  in  any  single  block  of  a 
corresponding  SS,  and  AS  contains  the  pairs  of  symbols 
that  are  contained  in  at  least  one  single  block  of  a 
corresponding  SS. 

Example  1,  (information  modeling  with  set  systems 
and  information  sets) 

For  the  function  from  Fig.  1  the  appropriate  set  systems 
and  information  sets  for  particular  input  and  output 
variables  and  for  the  whole  4-output  function  are  listed 
below: 

SS(x,)=  { { 1 ,2,5 } ; {3,4,5 } }  IS(x,)={l|3, 1|4,  2|3, 2|4}, 

SS(x2)=  { { 1 ,2,3,5 } ;  { 1 ,3,4} }  IS(x2)={1|4;  2|4;  4|5}, 

SS(x3)=  { { 1 ,2,3 } ; {4,5 } }  IS(x3)={1|4,1|5,2|4,2|5,3|4,3|5}, 
SS(X4)={{1,3,5};{2,4,5}}  IS(x4)={1|2,  1|4,  2|3, 3|4}, 

SS(x5)={  { 1,2,5}; {3,4} }  IS(x5)={ 1 13, 1 14,2|3,2|4,3 15,4|5 } , 
SS(x6)={{1,2,3,4};{2,3,4,5}}  IS(x6)={1|5}. 


SS(yi)=  { { 1 ,4,5 };  {2,3,5 } } 
SS(y2)=  { { 1 ,2,3,4 } ;  {4,5 } } 
SS(y3)=  { { 1,2,3,5 } ;  (2,4,5 } } 
SS(y4)=  { { 1 ,2,3,5 } ;  {4} } 


IS(y,)={l|2, 1|3,  2|4, 3|4}, 
IS(y2)={l|5;  2|5;  3|5}, 
IS(y3)=  { 1  K»3 14} , 
IS(y4)={l|4,2|4,  3|4, 4(5} . 


SS(yi,y2,y3,y4)={  { 1 } ;  {2,3} ;  {4} ;  {5} } 
IS(y„y2,y3,y4)={l|2,l|3,lK,l|5,2|4,2|5,3|4,4|5,3|5,4|5}l 

To  stress  that  the  symbols  from  a  pair  of  incompatible 
symbols  are  distinguished  from  each  other  and  to  shorten 
the  notation  of  information  sets,  we  will  further  denote 
each  pair  of  incompatible  symbols  {si,  Sj}  by  Sj|Sj. 

In  the  same  way  information  can  be  modeled  with  set 
systems  and  information  sets  for  any  discrete  system, 
including  multiple-valued  systems.  For  instance  in  the 
case  of  a  multiple-valued  function,  the  only  difference 
compared  to  the  binary  function  is  the  fact,  that  the  set 
systems  induced  by  particular  input  and  output  variables 
are  the  multiple-block  set  systems  instead  of  two-block 
set  systems.  Each  of  them  has  as  many  blocks  as  many 
values  can  take  a  certain  input/output  variable. 


3.  Information  relationships  and  measures 


When  performing  analysis  or  design  of  information 
systems,  we  often  ask  for  relationships  between 


information  in  various  information  streams,  places  or 
parts  of  a  considered  system.  In  general,  we  are  interested 
in  information  relationships  that  express  similarity, 
dissimilarity,  missing  information,  extra  information  etc. 
Below,  some  basic  information  relationships  are  recalled 
which  express  these  elementary  properties: 

•  common  information  Cl  (i.e.  information  that  is 
present  in  both  SS!  and  SS2):  CI(SS,,  SS2)  =  IS(SS0  n 
IS(SS2) 

•  total  (combined)  information  TI  (i.e.  information 
that  is  present  either  in  SS[  or  in  SS2):  TI(SSi,  SS2)  = 
IS(SS,)  uIS(SS2) 

•  missing  information  MI  (i.e.  information  that  is 
present  in  SSi,  but  missing  in  SS2):  MI(SSi,  SS2)  = 
IS(SS!)-IS(SS2) 

•  extra  information  El  (i.e.  information  that  is  missing 
in  SSi,  but  present  in  SS2):  EI(SSi,  SS2)  =  IS(SS2)  - 
IS(SS,) 

•  different  information  DI  (i.e.  information  that  is 
present  in  one  of  the  set  systems  and  missing  in  the 
other):  DI(SS!,  SS2)  =  MI(SSh  SS2)  u  EI(SS,,  SS2). 

Analogous  relationships  can  be  defined  for  abstraction 

[8], 

Example  2.  (some  information  relationships  for  the 
function  from  Fig.  1) 

CI(SS(yi),  SS(X4))  =  {1|2,3|4} 

TI(SS(y,),  SS(X4))  =  { 1 12,  1|3,  1|4,  2|3,  2|4,  3|4} 

MI(SS(y,),  SS(X4))={1|3,2|4} 

EI(SS(y,),  SS(X4))={1|4,2|3} 

DI(SS(yi),  SS(X4»  =  { 1|3,  1|4,  2)3, 2|4}  I 

With  the  relationship  apparatus  defined  above,  we  can 
answer  such  questions  as:  what  information  required  to 
compute  values  of  a  certain  variable  is  present  in  another 
variable,  what  information  is  missing,  what  extra 
information  is  present  etc. 

However,  knowledge  of  "what"  is  often  not  sufficient 
to  take  appropriate  design  decisions,  because  the  design 
decisions  are  based  in  most  cases  not  only  on  some 
qualitative,  but  also  on  some  quantitative  comparisons. 
Along  the  knowledge  of  "what",  some  knowledge  of 
"how  much"  and  “how  important”  is  required.  For 
instance:  how  much  of  the  required  information  provides 
a  certain  variable  and  how  much  another  one,  is  it  much 
more,  etc.  Moreover,  information  that  is  provided  by  just 
a  single  input  may  be  considered  as  more  important  than 
information  provided  by  ail  inputs,  an  input  that  delivers  a 
unique  information  may  be  considered  as  more  important 
than  another  input  that  delivers  only  information  provided 
also  by  some  other  inputs  etc. 

Therefore,  some  quantitative  relationship  measures  are 
often  necessary. 

For  a  single  set  system  SS  the  information  quantity 
IQ  is  defined:  IQ(SS)  =  |IS(SS)|. 

For  two  set  systems  SSI  and  SS2,  the  following 
relationship  measures  are  defined: 
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•  information  similarity  (affinity)  measure  ISIM: 

ISIMCSS,,  SS2)  =  |CI(SSi,  SS2)| 

•  information  dissimilarity  (difference)  measure 

IDIS:  IDIS(SSh  SS2)  =  |DI(SS,,  SS2)| 

•  information  decrease  (loss)  measure  IDEC: 

DDEC(SSls  SS2)  =  |MI(SSh  SS2)| 

•  information  increase  (growth)  measure  IINC: 

IINC(SS!,  SS2)  =  |EI(SS!,  SS2)| 

•  total  information  quantity  TIQ: 

TlQ(SSuSSi)  =  \TI(SSu  SS2)|. 

It  is  also  possible  to  define  some  relative  measures,  by 
normalizing  the  above  absolute  measures,  and  weighted 
measure,  for  example,  by  associating  an  appropriate 
importance  weight  w(Si|Sj)  with  each  elementary 
information  [8].  In  a  strictly  analogous  way,  the 
abstraction  relationship  measures  can  be  defined  [8]. 

4.  Application  of  the  relationships  and 
measures  to  logic  synthesis 

The  information  and  abstraction  relationships  as  well 
as  the  measures  for  the  strength  and  importance  of  the 
relationships  enable  designers  and  tools  to  analyze 
relationships  between  the  information  streams  of  the 
considered  systems  and  provide  them  with  data  necessary 
for  effective  and  efficient  decision-making.  Results  of  the 
relationship  analysis  make  it  possible  to  discover  the 
nature  of  the  considered  system  or  to  decide  its  structure 
in  order  to  satisfy  given  design  constraints  or  optimize 
certain  objectives. 

In  order  to  demonstrate  the  importance  of  the 
information  relationship  analysis  and  to  show  how  to 
apply  it  in  the  logic  synthesis  process,  let’s  .consider  the 
problems  of  the  input  support  minimization,  parallel 
(output)  decomposition  and  serial  functional 
decomposition.  These  problems  are  highly  relevant  and 
common  for  both  the  binary  logic  applications  (e.g.  VLSI 
circuit  design  [3]-[10][12][14][15][17]tl8][20]-[22])  and 
multiple-valued  logic  applications  (e.g.  pattern  analysis, 
knowledge  discovery,  machine  learning,  decision 
systems,  data  bases,  VLSI  circuit  design  etc. 
[11][12][I3][16][19][23]).  They  are  considered  below  for 
the  case  of  incompletely  specified  Boolean  functions. 
However  application  of  the  information  relationships  and 
measures  to  the  multiple-valued  logic  is  strictly 
analogous,  because  in  the  multiple-valued  case  the 
information  is  modeled  with  information  sets  the  same 
way  as  in  the  binary  case,  and  the  relationships  and 
measures  operate  on  the  information  sets. 

4.1.  Application  to  input  support  minimization. 

Input  support  minimization  consists  of  finding  a 
minimal  sub-set  of  inputs  that  contains  all  information 
that  was  required  to  compute  by  the  originally  specified 
function,  or  sequential  machine  [7]  [9]. 


Let’s  find  the  minimum  support  for  the  function  from 
Fig.  1,  by  analyzing  the  relationships  between  the 
information  required  for  computing  the  output  values 
(modeled  by  set  system  SS(y1,y2,y3,y4)= 

{ { 1 } ; (2,3 };{4};{5}})  and  information  provided  by 
particular  input  variables.  Some  of  the  interesting 
relationships  and  the  corresponding  measures  are  given 
below: 

CI(y,x1)={l|3,l|4,2|4}  |CI(y,  x,)|  =  3 

CI(y,x2)={l|4,2|4,4|5}  |CI(y,x2)|  =  3 

CI(y,  x3)  =  {1|4, 1|5,  2|4, 2|5, 3|4,  3|5}  |CI(y,  x3)|  =  6 

CI(y,  X4)  -  {1|2,  1|4,  3|4}  |CI(y,X4)|  =  3 

CI(y,  x5)  =  {1|3,  1|4,  2|4, 3|5, 4|5}  |CI(y,  x5)|  =  5 

CI(y,x6)={l|5}  |CI(y,  x<s)|  =  1 

Observe  that: 

|CI(y,  x3)|  >  |CI(y,  x5)|  >  |CI(y,  x,)|  =  |CI(y,  x2)|  = 

|CI(y,  X4)|  >  |CI(y,  Xg)|. 

The  information  similarity  measure  tells  us,  that  x3 
delivers  more  information  that  is  required  for  computing  y 
than  x5,  x5  more  than  Xi,  etc.  Furthermore,  x3  delivers  a 
unique  information  that  is  necessary  for  computing  the 
output,  but  is  not  delivered  by  any  other  input  variable, 
namely:  2|5.  Therefore,  x3  must  be  in  any  support,  thus 
also  in  any  minimal  support.  Since  CI(y,  x3)  does  not 
cover  IS(y)  completely,  the  symbols  from  the  pairs  given 
by  IS(y)  -  CI(y,  x3)  =  {1|2,  1|3,  4|5}  are  not  distinguished 
from  each  other  by  x3,  and  therefore  at  least  one  extra 
input  variable  is  necessary  for  the  minimum  support. 
Because  CI((IS(y)  -  CI(y,  x3)),  x,)  =  0,  CI((IS(y)  -  CI(y, 
x3)),  x2)  =  {4|5},  CI((IS(y)  -  CI(y,  x3)),  X4)  =  {1|2}, 
CI((IS(y)  -  CI(y,  x3)),  x5)  =  {1|3,  4|5},  CI(y,  X6)  =  0,  no 
single  variable  delivers  the  lacking  information,  but  X4 
and  x5  together  provide  it.  This  means  that  x3,  X4  and  x5 
constitute  the  minimum  input  support. 

In  a  similar  way  the  information  relationships  and 
weighted  measures  are  used  in  our  genetic  algorithm 
(GA)  for  finding  the  minimal  input  support  [7].  In 
particular,  the  weighted  information  similarity  measure  is 
used  in  GA  for  the  semi-random  construction  of  the  initial 
support  population.  The  probability  of  a  certain  input 
variable  to  be  selected  for  an  initial  support  is 
proportional  to  the  value  of  this  measure.  The  same 
measure  is  also  used  for  the  selection  of  input  variables  in 
the  local  search  performed  by  the  deterministic  merge  and 
repair  operators  of  the  genetic  algorithm  [7].  Table  1 
presents  the  test  results  of  GA  compared  to  the  strictly 
minimal  results  and  results  from  QuickScan  algorithm 
[7] [9]  computed  at  the  HP9000/735  processor  under  HP- 
UX  9.05.  The  results  from  GA  are  computed  quickly  and 
are  in  almost  all  cases  strictly  optimal  and  sometimes  on 
distance  one  from  optimum.  The  test  files  were  specially 
generated,  because  the  standard  logic  synthesis 
benchmark  set  contained  functions  with  minimal  input 
supports.  The  benchmarks  do  not  contain  either  essential 
or  dominated  inputs  and  are  “difficult”,  because  they 
cannot  be  solved  or  reduced  by  the  preprocessing 
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Table  1.  Test  results  of  the  genetic  algorithm 
(GA)  and  QuickScan  (QS). 
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20 
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48 

ilOO 
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11 

30 

10.83 

3:31 

9 

1:02 

9 

9:58 

Q-quality  (number  of  variables  in  the  support)  of  the  best  support  found, 
Q(min)  -  quality  of  the  minimum  support,  T  -  computation  time 
(min:sec),  QS  -  QuickScan,  GA  -  genetic  algorithm,  GA-  -  GA  without 
merge,  mg  -  merge  performed  on  the  full  original  support, 1  Average 
over  6  runs, 2  Average  over  8  runs. 

techniques  (i.e.  recursively  removing  the  rows  covered  by 
essential  columns,  and  the  dominated  columns  and  rows). 
The  number  by  “i”  in  the  benchmark’s  name  is  equal  to 
the  number  of  input  variables  of  the  benchmark.  More 
complete  characteristics  of  the  benchmarks  and  of  the 
way  they  were  generated  as  well  as  benchmark  results  of 
another  input  support  minimization  programs  can  be 
found  in  [7]  and  [9]. 

4.2.  Application  to  parallel  (output) 
decomposition. 

Parallel  decomposition  is  a  very  important 
decomposition  technique  used  for  multiple  output 
functions,  relations  and  sequential  machines 
[4]  [6]  [12]  [17].  In  parallel  decomposition,  computations  of 
values  for  certain  subsets  of  outputs  are  performed  in 
separate  blocks.  Thus,  parallel  decomposition  consists  of 
partitioning  of  the  set  of  output  variables  in  disjoint 
subsets,  each  satisfying  the  building  block  constraints,  so 
that  some  objectives  are  optimized  and  some  other 
possible  constraints  satisfied. 

Assume  that  each  logic  building  block  for 
implementing  the  considered  example  function  can 
implement  any  function,  with  maximally  2  inputs  and  2 
outputs  (as  in  look-up  table  FPGAs).  Since  the  example 
function  has  4  outputs  and  requires  minimum  3  inputs,  it 
is  impossible  to  implement  the  function  with  a  single 
building  block.  The  function  must  be  decomposed  into  at 
least  two  blocks.  Find  a  parallel  decomposition  with 


minimal  number  of  blocks  and  minimum 
interconnections. 

Let’s  compute  the  affinity  relationships  between  the 
information  required  by  particular  output  variables  and 
information  delivered  by  particular  input  variables  and  the 
corresponding  relative  similarity  measures.  For  simplicity 
sake,  let’s  consider  only  the  minimal  input  support  {x3, 
X4,  x5}  (the  function  with  the  original  support  can  be 
considered  precisely  the  same  way). 

CI(yi,  x3)  =  (2|4,  3|4}  ISIMr(yh  x3)  =  0,5 

CI(yb  X4)  =  {1|2,  3|4}  ISEMr(yb  X4)  =  0,5 

CI(yb  x5)  =  (1|3, 2|4}  ISIMr(yb  x5)  =  0,5 

CI(y2,  x3)  =  {1|5, 215,  3|5}  =  IS(y2)  ISIM^y,,  x3)  =  1 

CI(y2,  X4)=  0  ISIMr  (y2,  X4)  =  0 

CI(y2,  x5)=  0  ISIMr(y2,  x5)  =  0 

CI(y3,  x3)  =  {1|4,  3|4}  =  IS(y3)  ISIMr(y3,  x3)  =  1 

CI(y3,  X4)  =  {1|4,  3|4}  =  IS(y3)  ISIMr(y3,  X4)  =  1 

CI(y3,  x5)  =  {1(4} 

CI(y4,x3)={l|4,2|4,  3|4} 

CI(y4,  X4)  =  {1|4,  3|4} 

CI(y4>  x5)  =  {1|4,  2|4,  4|5} 

Since  ISIMr(y2,  x3)  =  1  and  ISIMr(y3,  x3)  =  1,  x3 
delivers  complete  information  required  for  computing  y2 
and  y3.  Thus,  y2  and  y3  can  be  implemented  with  just  a 
single  logic  block  with  one  input  x3  and  two  outputs  y2 
and  y3.  yi  and  y4  require  each  at  least  two  input  variables 
for  their  computation,  because  no  single  variable  delivers 
all  required  information  (ISIMr(yb  x{)<1  and  ISIMr(y4, 
Xj)<l,  for  all  i=3,4,5).  Furthermore,  x5  provides  a  unique 
information  for  yi  (1|3),  and  for  y4  (4|5),  not  provided  by 
any  other  input.  Importance  of  x5  is  thus  highest,  and 
therefore,  x5  must  be  used  both  for  computing  yj  and  y4. 
Similarly,  X4  provides  a  unique  information  for  yt  (1|2), 
importance  of  x4  is  also  highest  and  x4  must  be  used  for 
computing  yb  Since  CI(yb  {x4,  x5})  =  {1|2,  1|3,  2|4,  3|4} 
=  IS(yi),  {x4,  x5}  is  a  complete  minimum  support  for  yb 
{x4,  x5}  is  also  a  complete  minimal  support  for  y4. 


ISIMr(y3,x5)  =  0,5 
ISIMr(y4,  x3)  =  0,75 
ISIMr(y4,  X4)  =  0,5 
ISIMr(y4,  x5)  =  0,75 


because* 

CI(y4,  {X4,  x5})  =  { 1|4,  2|4,  3|4, 4|5}  =  IS(y4). 

Thus,  yi  and  y4  can  be  implemented  with  just  a  single 
building  block  with  two  inputs  x4  and  x5,  and  two  outputs 
yi  and  y4.  In  this  way,  we  constructed  a  parallel 
decomposition  of  the  considered  function  (Fig.  2)  that 
satisfies  the  input  and  output  constraints  of  the  used 
building  blocks,  uses  minimum  number  of  building 
blocks  (just  two  blocks)  and  minimum  interconnections 
(no  common  inputs  to  different  blocks). 

More  information  on  parallel  decomposition,  prototype 
tools  we  developed  for  this  aim  and  some  benchmark 
results  of  these  tools  can  be  found  in  [4]  and  [6]. 


4.3.  Application  to  serial  functional 
decomposition. 

Fig.  3  shows  the  schematic  representation  of  the  serial 
functional  decomposition. 
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Figure  2.  Optimal  parallel  decomposition  of  the 
example  function  from  Fig.  1. 


X  =  AuB 
A 


B 


Figure  3.  Serial  decomposition  scheme 

Let  X  be  the  set  of  input  variables  of  a  given  function 
F.  Let  A  and  B  be  two  subsets  of  X  such  that  Avj  B  =X. 
Let  P(,4)  be  a  set  system  induced  by  the  input  variables 
from  the  set  A  on  the  set  of  the  function  terms  (cubes)  F, 
let  P(2?)  be  a  set  system  induced  on  F  by  the  variables 
from  B  and  PF  be  a  set  system  induced  on  F  by  the  output 
variables  of  F. 

Theorem  1  If  there  exists  a  set  system  nG  on  F  such 
that  P(£)  <  nG,  and  P(^4)  •  nG  <  PF,  then  F  has  a  serial 
decomposition  with  respect  to  (A>B)  [17].  I 

The  conditions  for  serial  decomposition,  can  be  re¬ 
expressed  in  terms  of  the  information  sets  in  the  following 
way: 

Theorem  2.  If  there  exists  a  set  system  nG  on  F  such 
that  IS(P(B))  3  IS(IIG),  and  IS(P(A)  •  nG)  3  IS(PF),  then 
F  has  a  serial  decomposition  with  respect  to  (A,  B)  [20].  I 

In  general,  information  that  is  necessary  for  computing 
values  of  a  certain  output  of  a  considered  multiple-output 
function  is  distributed  on  a  number  of  its  inputs.  The 
inputs  also  deliver  some  information,  that  is  not  needed 
for  the  output,  and  information  on  the  inputs  is 
represented  otherwise  than  on  the  considered  output. 
Block  G  can  be  therefore  considered  as  a  block  where  an 
intermediate  information  transformation  is  performed 
which  involves  an  appropriate  combination,  abstraction 
and  re-structuring  of  the  input  information.  This 
transformation  consists  of  construction  of  an  appropriate 
set  system  nG  from  a  selected  set  system  P(B).  The  set 
system  nG  should  carry  a  part  of  the  information 
delivered  by  the  set  system  P(5),  which  in  combination 


with  information  delivered  by  the  set  system  P(4),  is 
essential  to  compute  the  required  output  information 
(Theorem  2).  Since  the  number  of  block’s  G  outputs 
should  be  in  practical  cases  much  smaller  than  the  number 
of  its  inputs,  nG  must  have  much  less  blocks  than  P(2?). 
nG  is  created  by  merging  the  blocks  of  ?(B)  and  a  lot  of 
abstraction  is  made  during  this  merging.  To  avoid  too  big 
loss  of  information  during  the  merging  process  or  creation 
of  set  systems  nG  with  too  large  number  of  blocks  (which 
would  require  too  many  physical  logic  blocks  in 
implementation),  set  B  should  contain  input  variables  that 
carry  relatively  much  information  which  is  not  important 
for  computing  values  of  the  output  variables.  Moreover,  a 
part  of  the  important  information  delivered  by  variables 
from  set  B  is  in  most  cases  also  delivered  by  some 
transfer  this  information  to  the  output  of  G.  It  can  be 
abstracted  during  the  merging  process.  Also,  the  set 
systems  induced  by  each  of  the  input  variables  from  set  B 
should  be  similar  each  to  another,  because  this  results  in 
less  blocks  of  P (B)  itself  and  less  merging  to  perform. 

Based  on  these  observations  the  following  rules  of 
input  variable  selection  for  set  B  have  been  developed. 
Variable  Xj  should  be  included  into  set  B  if: 

IINC(Pf,  P(Xj))  is  relatively  high, 

ISIM(P(A),  P(xj))  is  relatively  high, 

ISIM(P(B’),  P(xO)  is  relatively  high, 

where:  Xi  -  candidate  to  set  of  indirect  variables  B,  B’  - 
partially  created  set  B. 

Using  these  rules,  we  developed  and  implemented  a 
heuristic  method  for  the  (near-)  optimal  input  support 
selection  in  serial  decomposition.  Experimental  results 
from  our  method  are  presented  below  and  compared  to 
the  results  from  the  systematic  search.  In  the  systematic 
search  the  supports  are  implicitly  enumerated.  For  each 
support,  the  corresponding  set  system  nG  is  computed  and 
the  first  found  support  with  the  minimum  number  of 
blocks  in  nG  is  used  for  decomposition.  The  number  of 
blocks  of  partition  nG  shows  strong  positive  correlation 
with  the  number  of  logic  blocks  and  logic  levels  in  the 
resulting  decomposition,  and  therefore,  it  is  a  good 
measure  for  the  support  quality.  In  the  case  of  our 
heuristic  selection  also  the  first  support  is  used  for 
decomposition,  but  the  first  one  found  based  on 
information  relationships  measures.  We  used  for 
experiments  a  number  of  functions  from  the  international 
logic  synthesis  benchmark  set.  Table  2  and  3  report 
results  of  the  input  support  selection  for  set  B  in  a  single 
serial  decomposition  step,  as  illustrated  in  Fig.  2.  Table  4 
shows  the  results  of  decomposition  of  some  benchmark 
functions  into  a  network  of  4-input,  1 -output  logic  cells, 
obtained  by  repeating  the  serial  decomposition  step  from 
Fig.  3  a  number  of  times.  In  Table  2  the  comparison  of  the 
number  of  blocks  of  partition  IIG  is  presented  for  heuristic 
and  systematic  method. 
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Table  2.  Comparison  of  the  number  of  blocks  in 
partition  nQ  obtained  by  the  systematic  and 
heuristic  method. 
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The  systematic  search  always  finds  a  support,  which 
results  in  partition  nG  with  the  strictly  minimum  number 
of  blocks.  However,  the  method  based  on  information 
measures  produces  also  the  minimal  or  near-minimal 
results  and  does  it  much  faster.  Table  3  shows  comparison 
of  the  computation  time  for  both  methods  for  a  single 
support  selection  step.  In  Table  4,  the  number  of  logic 
cells  (4-input,  1-output  FPGA  cells)  in  decompositions  is 
presented  for  several  benchmarks,  for  decompositions 
found  by  application  of  the  heuristic  or  systematic  support 
selection  a  number  of  times.  The  method  based  on 
information  relationship  measures  produces 
decompositions  with  the  same  or  almost  the  same  number 
of  blocks  as  the  systematic  method.  In  some  cases  the 
results  from  the  heuristic  support  selection  are  even  better 
than  from  the  systematic.  More  information  on  the 
application  of  the  information  relationships  and  measures 
to  the  serial  functional  decomposition  is  presented  in  [18]. 

5.  Conclusion. 

The  famous  theory  of  partitions  and  set  systems  of 
Hartmanis  [2]  and  the  theory  of  information  relationships 
and  measures  discussed  here  form  just  together  an 
information  modeling  and  analysis  apparatus  that  is  of 
primary  importance  for  analysis  and  synthesis  of  discrete 
information  systems,  and  in  particular  for  logic  design. 

We  applied  the  apparatus  of  information  relationships 
and  measures  for  design  decision  making  in  a  number  of 
prototype  CAD  tools  for  solving  various  logic  design 
problems.  Among  others,  we  applied  this  apparatus  for 
solving  some  logic  decomposition  problems  and  finding 
the  minimal  input  support.  The  experimental  results 
obtained  with  our  prototype  tools  are  very  encouraging. 
For  example,  our  input  support  minimization  tool  is  able 


Table  3.  Comparison  of  the  computation  time  (in 
seconds)  for  the  systematic  and  heuristic 
method. 
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Table  4.  Comparison  of  the  number  of  logic  cells 
in  decomposition. 
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to  compute  the  minimal  supports  for  difficult  and  large 
(100  inputs)  problem  instances  within  few  minutes.  Our 
prototype  multi-level  decomposition  tool  for  FPGAs  is 
able  to  perform  a  single  step  of  the  functional 
decomposition  more  than  50  times  faster  than  the  tool 
based  on  systematic  search,  on  benchmarks  with  14-16 
inputs,  10-20  outputs  and  100-300  cubes,  when  producing 
the  strictly  optimal  or  near  optimal  results.  By  repetitively 
applying  the  decomposition  step,  our  tool  is  able  to 
completely  decompose  such  a  benchmark  into  4-input 
CLBs  thousands  times  faster  then  the  systematic 
algorithm,  when  producing  the  minimum  number  of 
CLBs  or  at  most  1  CLB  more  than  the  minimum  on  some 
of  the  checked  benchmarks.  The  difference  in  processing 
time  between  these  two  tools  grows  very  fast  with  the  size 
of  the  decomposed  function. 

The  application  examples  and  experimental  results 
from  our  prototype  tools  demonstrate  the  high  potential  of 
the  information  relationships  and  measures  in  solving 
various  logic  analysis  and  synthesis  problems 
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Abstract 

We  introduce  probabilistic  many-valued  logic  programs 
in  which  the  implication  connective  is  interpreted  as  mate¬ 
rial  implication.  We  show  that  probabilistic  many-valued 
logic  programming  is  computationally  more  complex  than 
classical  logic  programming.  More  precisely,  some  de¬ 
duction  problems  that  are  P-complete  for  classical  logic 
programs  are  shown  to  be  co-NP -complete  for  probabilis¬ 
tic  many-valued  logic  programs.  We  then  focus  on  many¬ 
valued  logic  programming  in  Pr*  as  an  approximation  of 
probabilistic  many-valued  logic  programming.  Surpris¬ 
ingly,  many-valued  logic  programs  in  Pr*  have  both  a  prob¬ 
abilistic  semantics  in  probabilities  over  a  set  of  possible 
worlds  and  a  truth-functional  semantics  in  the  finite-valued 
Lukasiewicz  logics  Ln.  Moreover,  many-valued  logic  pro¬ 
gramming  in  Pr*  has  a  model  and fixpoint  characterization, 
a  proof  theory,  and  computational  properties  that  are  very 
similar  to  those  of  classical  logic  programming. 

1.  Introduction 

We  start  by  presenting  probabilistic  many-valued  logic 
programs  in  which  the  implication  connective  is  inter¬ 
preted  as  material  implication.  We  show  that  probabilistic 
many-valued  logic  programming  in  this  framework  is  com¬ 
putationally  more  complex  than  classical  logic  program¬ 
ming.  More  precisely,  some  deduction  problems  that  are 
P-complete  for  classical  logic  programs  are  shown  to  be  co- 
NP-complete  for  probabilistic  many-valued  logic  programs 
(see  also  [15]  and  [14]  for  other  work  on  the  subtleties  and 
the  computational  complexity  of  probabilistic  deduction). 

We  then  focus  on  many-valued  logic  programming  in 
Pr*  as  an  approximation  of  probabilistic  many-valued  logic 
programming.  Crucially,  many-valued  logic  programming 
in  Pr*  has  a  model  and  fixpoint  characterization  and  a 
proof  theory  that  are  very  similar  to  those  of  classical  logic 
programming.  Furthermore,  special  cases  of  many-valued 
logic  programming  in  Pr*  have  the  same  computational 
complexity  like  their  classical  counterparts. 

Surprisingly  (and  at  first  sight  even  paradoxically). 


many-valued  logic  programs  in  Pr*  have  both  a  proba¬ 
bilistic  semantics  in  probabilities  over  a  set  of  possible 
worlds  and  a  truth-functional  semantics  in  the  finite- valued 
Lukasiewicz  logics  Ln.  That  is,  many- valued  logic  pro¬ 
gramming  in  Pr*  lies  in  the  intersection  between  probabilis¬ 
tic  logics  and  truth-functional  many-valued  logics. 

The  literature  already  contains  quite  extensive  work  on 
probabilistic  and  on  truth-functional  many- valued  logic  pro¬ 
gramming  separately.  However,  to  the  best  of  our  knowl¬ 
edge,  an  integration  of  both  has  never  been  studied  so  far. 

Probabilistic  propositional  logics  and  their  various  di¬ 
alects  are  thoroughly  studied  in  the  literature  (see,  for  ex¬ 
ample,  [18]  and  [5]).  Their  extensions  to  probabilistic  first- 
order  logics  can  be  classified  into  first-order  logics  in  which 
probabilities  are  defined  over  a  set  of  possible  worlds  and 
those  in  which  probabilities  are  given  over  the  domain  (see, 
for  example,  [8]  and  [2]).  The  first  ones  are  suitable  for 
representing  degrees  of  belief,  while  the  latter  are  appropri¬ 
ate  for  describing  statistical  knowledge.  The  same  classi¬ 
fication  holds  for  probabilistic  logic  programming  (see,  for 
example,  [17],  [14],  and  [16]). 

Many  approaches  to  truth-functional  finite- valued  logic 
programming  are  restricted  to  three  or  four  truth  values  (see, 
for  example,  [10],  [6],  and  [4]).  Among  these  approaches, 
the  one  closest  in  spirit  to  many- valued  logic  programming 
in  Pr*  is  perhaps  the  three- valued  one  in  [10]. 

Many- valued  logic  programming  in  Pr*  is  closely  re¬ 
lated  to  van  Emden’s  infinite- valued  quantitative  deduction 
[19].  More  precisely,  it  is  an  approximation  of  probabilis¬ 
tic  logic  programming  under  the  material  implication,  while 
van  Emden’s  quantitative  deduction  can  be  understood  as 
an  approximation  of  probabilistic  logic  programming  under 
the  conditional  probability  implication  [14]. 

Moreover,  many-valued  logic  programming  in  Pr*  is  re¬ 
lated  to  the  work  on  generalized  annotated  logic  program¬ 
ming  [9]  and  to  signed  formula  logic  programming  [11]. 

Many- valued  logic  programming  in  Pr*  itself  was  initi¬ 
ated  in  [12],  where  we  already  presented  a  model  and  fix- 
point  characterization. 

The  rest  of  this  paper  is  organized  as  follows.  Section  2 
deals  with  probabilistic  many-valued  logic  programming. 
In  Section  3,  we  concentrate  on  many-valued  logic  pro- 
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gramming  in  Pr* .  Section  4  summarizes  the  main  results. 

This  paper  is  an  extract  from  a  longer  version  [13],  which 
includes  in  full  detail  all  the  proofs  missing  here. 

2.  Many-valued  logic  programming  in  Prn 

2.1.  Technical  preliminaries 

We  briefly  summarize  how  classical  first-order  logics  can 
be  given  a  probabilistic  n-valued  semantics  with  n  >  3 
in  which  probabilities  are  defined  over  a  set  of  possible 
worlds.  We  basically  follow  the  important  work  of  Halpem 
[8],  which  we  adapt  and  restrict  to  the  n-valued  setting. 

Let  TV  =  {0,  . . . ,  1}  with  n  >  3  denote  the 

set  of  truth  values.  Let  $  be  a  first-order  vocabulary  that 
contains  a  set  of  function  symbols  and  a  set  of  predicate 
symbols  (as  usual,  constant  symbols  are  function  symbols 
of  arity  zero;  we  say  4>  is  function-free  if  it  does  not  contain 
any  function  symbols  of  arity  greater  than  zero).  Let  X  be 
a  set  of  object  and  truth  variables. 

We  define  object  terms  by  induction  as  follows.  An  ob¬ 
ject  term  is  an  object  variable  from  X  or  an  expression  of 
the  kind  f(t i, . . . ,  tk),  where  /  is  a  function  symbol  of  ar¬ 
ity  k  >  0  from  $  and  t\, . . . ,  tk  are  object  terms.  A  truth 
term  is  a  truth  value  from  TV  or  a  truth  variable  from  X. 
We  define  formulas  by  induction  as  follows.  If  p  is  a  pred¬ 
icate  symbol  of  arity  k  >  0  from  $  and  t\ , . . . ,  tk  are  ob¬ 
ject  terms,  then  p(ti  ,...,£*)  is  a  formula  (called  atom).  If 
F  and  G  are  formulas,  then  -» F,  (F  AG),  (FVG),  and 
(F  <-  G)  are  formulas.  If  F  is  a  formula  and  x  is  an  ob¬ 
ject  variable  from  X,  then  Vx  F  and  3x  F  are  formulas.  An 
n-valued  formula  is  an  expression  tv(F)  >  t,  where  F  is  a 
classical  formula  and  t  is  a  truth  term. 

An  interpretation  I  =  (I?,7r)  consists  of  a  nonempty 
set  D ,  called  domain ,  and  a  mapping  7r  that  assigns  to  each 
function  symbol  from  #  a  function  of  right  arity  over  D 
and  to  each  predicate  symbol  from  $  a  predicate  of  right 
arity  over  D.  A  variable  assignment  o  is  a  mapping  that 
assigns  to  each  object  variable  from  X  an  element  from  D 
and  to  each  truth  variable  from  X  a  truth  value  from  TV. 
For  an  object  variable  x  from  X  and  an  element  d  from 
D ,  we  write  o[x/d\  to  denote  the  variable  assignment  that 
is  identical  to  a  except  that  it  assigns  d  to  x  (for  a  truth 
variable  x  from  X  and  a  truth  value  c  from  TV,  the  notation 
o [x/c\  has  an  analogous  meaning).  The  variable  assignment 
a  is  by  induction  extended  to  all  object  and  truth  terms  by 
defining  <r(/(ti,...,  tk))  =  7r(/)(a(ti ),...,  <r(tk))  for  all 
object  terms  /(fi, . . . ,  tk)  and  <j(c)  =  c  for  all  truth  values 
c  from  TV.  The  truth  of  formulas  F  ini  under  a,  denoted 
I  F,  is  inductively  defined  as  follows: 

•/  Kr  P(ti,---,tk)  iff  (<r(*i),...,a(4))  e  t r(p). 

•  I  \=a  -*F  iff  not  I  \=a  F. 

•  I  K  (F  a  G)  iff  I  K  F  and  I  K  G. 


•  I  \=aVxF  iff  I  \=<r[x/d]  F  for  all  d  €  D. 

•  The  truth  of  the  remaining  formulas  in  I  under  a  is 
defined  by  expressing  V,  ,  and  3  in  terms  of  A, 
and  V  as  usual. 

A  formula  F  is  true  in  I,  or  I  is  a  model  of  F ,  denoted 
I  (=  F,  iff  F  is  true  in  I  under  all  variable  assignments  a. 

A  probabilistic  interpretation  (Pr ^interpretation)  Pr 
is  a  triple  (D,l,p),  where  D  is  a  nonempty  set  (called 
domain),  X  is  a  set  of  classical  interpretations  over  D 
(which  are  called  possible  worlds)  such  that  7 r*(/)  =  7 Tj(f) 
for  all  function  symbols  /  from  $  and  all  interpretations 
(£>,  7Tj),  (D,  7tj)  €  1,  and  p  is  a  mapping  from  X  to  the  set 
of  truth  values  TV  such  that  all  p(I)  with  I  €  X  sum  up 
to  1.  The  truth  value  Pra(F)  of  a  formula  F  in  the  Prn- 
interpretation  Pr  under  a  variable  assignment  a  is  defined 
as  follows  (we  write  Pr(F)  if  F  is  variable-free): 

Pr*(F)  =  E  (D 

An  n-valued  formula  tv(F)  >  t  is  true  in  Pr  under  o  iff 
Pra  (F)  >  a(t) .  An  n-valued  formula  P  is  true  in  Pr,  or 
Pr  is  a  model  of  P ,  denoted  Pr  |=  P,  iff  P  is  true  in  Pr 
under  all  variable  assignments  a.  Pr  is  a  model  of  a  set  of 
n-valued  formulas  V ,  denoted  Pr  (=  V,  iff  Pr  is  a  model 
of  all  n-valued  formulas  in  V.  V  is  satisfiable  iff  a  model  of 
V  exists.  P  is  a  logical  consequence  of  V,  denoted  V  |=  P, 
iff  each  model  of  V  is  also  a  model  of  P. 

For  an  n-valued  formula  tv(F)  >  c  with  a  truth  value 
c  from  TV  and  a  set  of  n-valued  formulas  V,  let  c  denote 
the  set  of  all  truth  values  Pra{F)  in  models  Pr  of  V  under 
variable  assignments  o.  It  is  easy  to  see  that  tv(F)  >  c  is 
a  logical  consequence  of  V  iff  c  <  min  c.  Hence,  we  get 
a  natural  notion  of  tightness  for  logical  consequences:  the 
n-valued  formula  tv(F)  >  c  is  a  tight  logical  consequence 
of  V ;  denoted  V  ^ tight  tv(F)  >  c,  iff  c  —  min  c. 

A  Herbrand  Prn-interpretation  (X,  p)  consists  of  a  set  X 
of  classical  Herbrand  interpretations  over  $  (that  is,  subsets 
of  the  Herbrand  base  HB$>  over  $)  and  a  mapping  p  from 
X  to  TV  such  that  all  p(I)  with  I  £  X  sum  up  to  1. 

Terms,  formulas,  n-valued  formulas,  and  sets  of  n- 
valued  formulas  are  ground  iff  they  do  not  contain  any  vari¬ 
ables.  The  notions  of  substitutions,  ground  substitutions, 
instances  of  formulas,  and  ground  instances  of  formulas  are 
defined  as  usual.  The  last  two  are  assumed  to  be  canonically 
extended  to  n-valued  formulas.  Finally,  we  also  adopt  the 
usual  conventions  to  eliminate  parentheses. 

2.2.  Many-valued  logic  programs 

We  now  introduce  probabilistic  many- valued  logic  pro¬ 
grams.  We  start  by  defining  many-valued  program  clauses, 
which  are  special  many- valued  formulas. 

An  n-valued  program  clause  is  an  n-valued  formula 
tv  (if  V  ->f?i  V  •  •  •  V  ->J3*)  >  c,  where  if,  Bi, . . . ,  £?*  with 
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k  >  0  are  atoms  and  c  is  a  truth  value  from  TV.  It  is  ab¬ 
breviated  by  (H  <-  Bi, . . . ,  Bk)[c ,  1].  Note  that  all  object 
variables  in  an  n-valued  program  clause  are  implicitly  uni¬ 
versally  quantified.  An  n-valued  logic  program  V  is  a  finite 
set  of  n-valued  program  clauses.  We  use  ground(V)  to  de¬ 
note  the  set  of  all  ground  instances  of  clauses  in  V. 

Many- valued  program  clauses  can  be  classified  into  facts 
and  rules:  facts  are  of  the  kind  ( H  «-  )[c,  1],  while  rules 
have  the  form  (H  «-  B\, . . . ,  Bk)[c ,  1]  with  k  >  0.  They 
can  also  be  divided  into  logical  and  purely  many-valued 
program  clauses:  logical  program  clauses  are  of  the  kind 
(H  <-  Biy...jBk)[l,l],  while  purely  many-valued  ones 
have  the  form  (H  «-  B\, . . . ,  Bk)[c ,  1]  with  c  <  1. 

Next,  we  introduce  many-valued  queries,  answer  substi¬ 
tutions,  and  answers.  An  n-valued  query  to  an  n-valued 
logic  program  V  is  an  expression  3(Ai, . . . ,  Ai)[t,  1], 
where  Ai, . . . ,  Ai  with  l  >  1  are  atoms  and  t  is  a  truth 
term.  An  n-valued  query  is  object-ground  iff  it  does  not 
contain  any  object  variables.  Given  an  n-valued  query 
Qc  =  3(Ai, . . . ,  Ai)[c,  1]  with  c  e  TV,  we  are  interested 
in  its  correct  answer  substitutions,  which  are  substitutions  0 
such  that  V  [=  tv((Ai  A- •  •  A  Ai)0)  >  c and  that  6  acts  only 
on  variables  in  Qc.  The  correct  answer  for  Qc  is  Yes  if  a 
correct  answer  substitution  exists  and  No  otherwise.  Given 
anti-valued  query  Qx  —  3 (Ax, . . . ,  Ai)[x,  1]  with  x  e  X, 
we  are  interested  in  its  tight  answer  substitutions,  which  are 
substitutions  9  such  that  V  | =  tight  tv((AiA-  •  *A Ai)9)  >  x9, 
that  6  acts  only  on  variables  in  Qx,  and  that  x9  is  a  truth 
value  from  TV.  Note  that  such  n-valued  queries  Qx  al¬ 
ways  have  a  tight  answer  substitution. 

Example  2.1  Let  n  =  101  and  let  the  n-valued  logic  pro¬ 
gram  V  contain  the  following  rules  and  facts  ( R ,  S,  and  T 
are  object  variables;  h,  a,  b,  and  o  are  con.«;ants): 

(re(R,S)<r-ro(R,S))[.  7, 1] 

(re(R,S)  ro(R,  5),  so(R,  S))[.9, 1] 

(i re(R ,  S)  <-  ro(R ,  S) ,  ad(R,  5))[1, 1] 

(re(R,S)  re(R,T),re(T,S))[  1,1] 

(ro(h,a)  <r-  )[1,1  ],(ad(h,a)  )[1,1] 

( ro(a,b )  <r-  )[1, 1],  (ad(a,  b)  <-  )[.8,1] 

(ro(b,  o)  <-  )[l,l],(so(6,  o)  <-  )[1,1] 

Then,  some  many-valued  queries  are  3(re(A,  o))[.99, 1], 
3 (re(h,  U))[.8, 1],  and  3 (re(h,  o))[X ,  1],  where  U  is  an  ob¬ 
ject  variable  and  X  is  a  truth  variable.  The  correct  answer 
for  3 (re(/i,  o))[.99, 1]  to  V  is  No,  whereas  the  correct  an¬ 
swer  for  3(re(ft,l/))[.8, 1]  to  V  is  Yes  (all  the  correct  an¬ 
swer  substitutions  for  3 (re(h,  [7))[.8,1]  to  V  are  given  by 
{U/a}  and  {U/b}).  Finally,  the  unique  tight  answer  substi¬ 
tution  for  3(re(A,  o))^,  1]  to  V  is  given  by  {X/.7}  . 

Like  classical  logic  programs,  many-valued  logic  pro¬ 
grams  have  the  nice  property  that  they  are  always  satisfiable 


[13].  Furthermore,  ground  many- valued  formulas  are  log¬ 
ically  entailed  in  Pr„-interpretations  iff  they  are  logically 
entailed  in  Herbrand  Prn -interpretations  [13]. 

In  the  sequel,  we  use  probabilistic  many-valued  logic 
programming  as  a  synonym  for  the  problem  of  deciding 
whether  Yes  is  the  correct  answer  for  a  given  ground  many¬ 
valued  query  to  a  many- valued  logic  program. 

2.3.  Computational  complexity 

We  now  analyze  the  computational  complexity  of  two 
decidable  special  cases  of  probabilistic  many- valued  logic 
programming.  The  first  one  is  a  generalization  of  proposi¬ 
tional  logic  programming,  while  the  second  one  generalizes 
the  decision  problem  that  defines  the  data  complexity  of  dat- 
alog.  These  two  special  cases  are  of  special  interest,  since 
their  classical  counterparts  have  the  nice  property  that  they 
are  P-complete  (see,  for  example,  [3]  for  a  survey). 

Crucially,  the  P-completeness  does  not  carry  over  to  the 
two  probabilistic  many-valued  generalizations: 

Theorem  2.2  a)  The  problem  of  deciding  whether  Yes 
is  the  correct  answer  for  a  ground  n-valued  query 
3(Ai, . . . ,  Ai)[c,  1]  to  a  ground  n-valued  logic  program  V 
is  co-NP -complete,  b)  Let  be  function-free.  Let  V  be 
a  fixed  n-valued  logic  program  and  let  T  be  a  varying  fi¬ 
nite  set  of  ground  logical  facts.  Let  V  U  T  contain  all  con¬ 
stant  symbols  from  <£.  The  problem  of  deciding  whether 
Yes  is  the  correct  answer  for  a  ground  n-valued  query 
3(Ai , . . . ,  Ai)[c,l]  to  V  U  J7  is  co-NP-complete. 

Hence,  restricted  deduction  problems  that  are  computa¬ 
tionally  tractable  for  classical  logic  programs  are  presum¬ 
ably  intractable  for  many-valued  logic  programs.  Thus,  any 
attempt  towards  efficient  probabilistic  many- valued  logic 
programming  should  be  guided  by  looking  for  efficient 
special-case,  average-case,  or  approximation  techniques. 

3.  Many-valued  logic  programming  in  Pr£ 

3.1.  Pr* -interpretations 

Probabilistic  many-valued  logic  programming  as  intro¬ 
duced  in  Section  2.2  has  a  well-defined  probabilistic  se¬ 
mantics.  However,  its  increased  computational  complexity 
compared  to  classical  logic  programming  is  quite  discour¬ 
aging  for  a  broad  use  in  practice,  especially  for  a  possible 
application  in  large  knowledge-base  systems. 

This  increase  in  complexity  seems  to  be  mainly  due  to 
the  probabilistic  semantics  in  its  full  generality.  In  fact, 
we  now  provide  a  truth-functional  approach  to  many- valued 
logic  programming  that  approximates  our  probabilistic  one 
and  that  is  less  computationally  complex.  The  main  idea  is 
to  focus  on  a  special  kind  of  Prn-interpretations: 
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A  Pv^-interpretation  is  a  Prn-interpretation  Pr  with 

Pra(A  A  B)  =  min (Prv(A),  Prv(P))  (2) 

for  all  variable  assignments  cr  and  all  atoms  A  and  B. 

Interestingly,  (2)  is  equivalently  expressed  as  follows. 

Theorem  3.1  Let  Pr  =  (D,X,  p)  be  a  Pxn-interpretation. 

It  holds  Pra(A  A  B)  =  min (Pra(A),  Pra(B))  for  all 
variable  assignments  o  and  all  atoms  A  and  B  iff  all  the 
interpretations  I  E  1  with  p(I)  >  0  can  be  written  in  a 
sequence  (D,7Ti),  . . . ,  (D,7r*)  such  that  for  all  predicate 
symbols p from  7Ti(p)  D  ^{p)  2  *  * '  2  Kk(p)- 

A  Pr n~model  of  a  set  of  n-valued  formulas  V  is  a  Pr*- 
interpretation  that  is  a  model  of  V.  The  set  of  n-valued 
formulas  V  is  satisfiable  in  Pr*  iff  a  Pr* -model  of  V  exists. 
The  n-valued  formula  P  is  a  logical  consequence  in  Pr*  of 
V  iff  each  Pr* -model  of  V  is  also  a  model  of  P.  The  n~ 
valued  formula  tv(F)  >  c  is  a  tight  logical  consequence  in 
Pr*  of  V  iff  c  is  the  minimum  of  all  truth  values  Pra  ( F )  in 
Pr* -models  Pr  of  V  under  variable  assignments  cr. 

The  next  theorem  shows  that  tight  logical  consequences 
in  Pr*  approximate  logical  and  tight  logical  consequences 
in  Prn.  In  particular,  for  many-valued  logic  programs  V  and 
formulas  P,  this  theorem  shows  that  V  \=tight  tv(F)  >  0  in 
Pr*  immediately  entails  V  \=  tight  tv(P)  ^  0  in  Prn. 

Theorem  3.2  Let  T  be  a  set  of  n-valued formulas ,  let  F  be 
a  formula,  and  letc  E  TV.  If  T  \=tight  tv(-^)  ^  c  *n 
then  all  truth  values  d  €  TV  with  T  |=  tv(F)  >  d  in  Prn 
are  contained  in  {0, . . . ,  c}  C  TV. 

3.2.  Comparison  with  Ln -interpretations 

We  now  focus  on  the  relationship  between  Pr* -interpre¬ 
tations  and  interpretations  in  L„.  We  first  define  Ln -inter¬ 
pretations  and  the  truth  value  of  classical  formulas  in  Ln- 
interpretations  under  variable  assignments. 

An  ^-interpretation  L  =  (D,  7r)  consists  of  a  non¬ 
empty  domain  D  and  a  mapping  n  that  assigns  to  each  k- 
ary  function  symbol  from  $  a  mapping  from  Dk  to  D  and 
to  each  A;-ary  predicate  symbol  from  a  mapping  from  Dk 
to  the  set  of  truth  values  TV.  The  truth  value  La(F)  of  a 
formula  F  in  the  Ln -interpretation  L  under  a  variable  as¬ 
signment  cr  is  inductively  defined  by: 

•  L„{p{tu . .  .,<*))  =  7r(p)(<r(*i), . . . ,  a (t*)). 

•  La(-iF)  =  1  -La{F). 

•  La{F  AG)  =  min  (L„(F),La(G)). 

•  L„(FV  G)  =  max(I„  (F),L„  (G) ) . 

•  L„(F  <-  G)  =  min(l,  La(F)  -  L^G)  +  1). 

•  La(Vx  F)  =  min{I<r[l/d](F)  |  d  €  D}. 

•  La(3xF )  =  m&x{La[x/tj](F)\d  e  D}. 


We  next  show  that  for  logical  combinations  of  certain 
formulas,  the  truth  value  in  Pr* -interpretations  under  vari¬ 
able  assignments  is  defined  like  the  truth  value  in  Ln- 
interpretations  under  variable  assignments. 

Lemma  3.3  Let  Pr  =  {D,l,p)  be  a  Px^-interpretation 
and  let  o  be  a  variable  assignment.  For  all  object  variables 
x  6  X,  all  formulas  F,  and  all  formulas  G  and  H  that  are 
built  without  the  logical  connectives  and  <r-: 

Pra{pF)  =  1  -Prv(F)  (3) 

Pra(G  AH)  =  min (Pra(G),Pra(H))  (4) 

Pra(GVH)  =  ma x(Pra{G),Pra(H))  (5) 

Pra(G  <-  H)  =  min(l,  Prff(G)  -  Pra(H)  +  1)  (6) 

Prv(VzG)  =  min{Pra[a./d](G)  \d  E  D)  (7) 

Pra(3xG)  =  max{Pr<7[a./d](Gf)  |  d  E  D}  .  (8) 

This  means  that  Pr*-  and  L„-interpretations  give  the 
same  truth  value  to  all  formulas  built  without  the  logical 
connectives  -i  and  and  to  all  logical  combinations  of 
these  formulas  (thus,  also  to  classical  program  clauses): 

Theorem  3.4  Let  Pr  be  a  Px^-interpretation ,  let  L  be  an 
Ln-interpretation ,  and  let  a  be  a  variable  assignment.  If 
Pra{A)  =  La(A)  for  all  atoms  A ,  then  Prc(G)  =  La(G), 
Pra(-iG)  =  La(-^G),  and  Pra(G  <—  H )  =  La{G  <—  H) 
for  all  formulas  G  and  H  built  without  -i  and 

Note  that  there  also  exist  formulas  with  different  truth 
values  in  Pr* -interpretations  and  in  Ln-interpretations: 

Theorem  3.5  There  are  Px^-interpretations  Pr,  Ln-inter¬ 
pretations  L,  variable  assignments  o,  and  formulas  G  with 
Pra{A)  =  La(A)  for  all  atoms  A  and  Pra(G)  ■£  La(G). 

This  last  theorem  is  not  surprising,  since  Pr* -interpreta¬ 
tions  still  satisfy  the  axioms  of  probability.  That  is,  Pr* -in¬ 
terpretations  always  give  the  same  truth  value  to  formulas 
that  are  logically  equivalent  in  the  classical  sense.  Ln-inter¬ 
pretations,  in  contrast,  do  not  have  this  property. 

3.3.  Many- valued  logic  programs 

We  keep  the  definitions  of  many-valued  program  clauses 
and  many-valued  programs  from  Section  2.2.  In  particu¬ 
lar,  the  semantics  of  many- valued  program  clauses  in  Pr* - 
interpretations  is  already  given  by  the  semantics  of  many¬ 
valued  formulas  in  Prn-interpretations.  The  truth  of  many¬ 
valued  program  clauses  in  Pr* -interpretations  is  then  addi¬ 
tionally  characterized  as  follows. 

Lemma  3.6  For  all  Pr* -interpretations  Pr  =  (D,l,p), 
all  variable  assignments  a,  and  all  n-valued  program 
clauses  ( H  «-  B\, . . .  ,Fjk)[c,  1]«* 

(H  «-  Bi , . . . ,  Bk)[c,  1]  is  true  in  Pr  under  a  iff 

Pra(H)  >  c  -  1  +  min(Prv(Pi), . . . ,  Pr^P*)) . 
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Given  an  n- valued  query  Qc  =  3(Ai , . . . ,  Aj)[c,  1]  with 
c  E  TV,  we  are  interested  in  its  correct  answer  substi¬ 
tutions  in  Pr*,  which  are  substitutions  9  such  that  V  |= 
t\i(Ai9  A  •  *  ■  A  Ai6)  >  c  in  Pr*  and  that  0  acts  only  on 
variables  in  Qc .  The  correct  answer  in  Pr*  for  Qc  is  Yes 
if  a  correct  answer  substitution  in  Pr*  exists  and  No  other¬ 
wise.  Given  an  n-valued  query  Qx  =  3(Ai , . . . ,  Ai)[x,  1] 
with  x  E  X,  we  are  interested  in  its  tight  answer  substitu¬ 
tions  in  Pr* ,  which  are  substitutions  9  such  that  V  f -tight 
tv(A\9  A  •  *  •  A  Ai9)  >  x9  in  Pr*,  that  9  acts  only  on  vari¬ 
ables  in  Qx ,  and  that  x9  is  a  truth  value  from  TV . 

Example  3.7  Let  n  =  101  and  let  V  be  the  n- valued  logic 
program  from  Example  2.1.  The  correct  answer  in  Pr*  for 
the  n-valued  query  3 (re(h,  o))[.99, 1]  to  V  is  No,  whereas 
the  correct  answer  in  Pr*  for  3 (re(/&,  L/))[.8, 1]  to  V  is  Yes 
(note  that  all  the  correct  answer  substitutions  in  Pr*  for 
3(re(A,P))[.8,l]  to  V  are  given  by  {U/a},  {U/b},  and 
{  Uj o}).  Finally,  the  unique  tight  answer  substitution  in  Pr* 
for  3 (re(h,  o))[X,  1]  to  V  is  given  by  {X/.8}  . 

Note  that  many- valued  logic  programs  are  always  satisfi- 
able  in  Pr*  [13].  Moreover,  ground  many-valued  formulas 
are  logically  entailed  in  Pr* -interpretations  iff  they  are  log¬ 
ically  entailed  in  Herbrand  Pr* -interpretations  [13]. 

In  the  sequel,  we  use  many-valued  logic  programming  in 
Pr*  as  a  synonym  for  the  problem  of  deciding  whether  Yes 
is  the  correct  answer  in  Pr*  for  a  given  ground  many- valued 
query  to  a  many- valued  logic  program. 

3.4.  Model  and  fixpoint  semantics 

We  briefly  discuss  the  model  and  fixpoint  semantics  of 
many- valued  logic  programs  in  Pr*  [12].  In  the  sequel,  let 
V  be  an  n- valued  logic  program. 

We  focus  on  Herbrand  Pr* -interpretations,  which  we 
identify  with  fuzzy  sets.  In  detail,  each  Herbrand  Pr* -inter¬ 
pretation  (1,  p)  is  identified  with  the  fuzzy  set  I :  HB<$ 

TV,  where  I[A]>  for  all  A  E  HB$,  is  the  sum  of  all 
p(I)  with  I  e  1  and  I  \=  A.  We  subsequently  use  bold 
symbols  to  denote  such  fuzzy  sets.  The  fuzzy  sets  0  and 
HB$  are  defined  by  0[A]  =  0  and  HB$[A]  =  1  for  all 
A  E  HB$.  Finally,  we  define  the  intersection,  the  union, 
and  the  subset  relation  for  fuzzy  sets  Si  and  S2  as  usual  by 
Si  H  S2  =  min(Si,S2),  Si  U  S2  =  max(Si,S2),  and 
Si  C  S2  iff  Si  =  Si  n  S2,  respectively. 

We  define  the  immediate  consequence  operator  Tp  as 
follows.  For  all  I  C  HB$  and  H  E  HB$  : 

TV(I)[H ]  =  max({c  -  1  +  min(/[Si], . . . , I[Bk])  | 
(H<-Bi,...,Bk)[c,l]  E  ground(V)}  U  {0}) . 

Note  that  we  define  min(/[£i], . . . ,  /[#*])  =  1  for  k  =  0. 

For  all  I  C  HB&,  we  define  Tpfuj(I)  as  the  union 
of  all  Tpfl(I)  with  l  <  u,  where  T'p tO(J)  =  I  and 


Tvt  (1+ 1)(/)  =  Tv(Tptl(I))  for  all  l  <  u>.  Finally,  we 
abbreviate  Tp-f  a(0)  by 

The  model  and  fixpoint  semantics  of  many- valued  logic 
programs  in  Pr*  is  now  expressed  as  follows. 

Theorem  3.8 

{){I\ICHB*,  I|=P}  =  lfp(Tv )  =  Tptu,. 

Thus,  tight  answer  substitutions  for  object- ground  many¬ 
valued  queries  can  be  characterized  as  follows. 

Theorem  3.9  Let  V  be  an  n-valued  logic  program  and  let 
3(^4i, . . . ,  A{)[x ,  1]  be  an  object-ground  n-valued  query 
with  x  €  X.  The  tight  answer  substitution  in  Pr*  for 
3(^4i, . . . ,  Ai)[x,  1]  to  V  is  given  by  { x/c },  where  c  is  the 
minimum  of  all  Tpfu[Ai]  with  i  E  [1 :/]. 

3.5.  Proof  theory 

We  now  present  SLDPr* -resolution  for  many-valued 
logic  programs  in  Pr* ,  which  is  an  extension  of  the  classical 
SLD-resolution  (see,  for  example,  [1]).  In  the  sequel,  many¬ 
valued  facts  (A  <-  )[c,  1]  are  abbreviated  by  {A)[c,  1]. 

A  subgoal  list  is  a  finite  list  (Ai)[ai,  1] . . .  (Am)[om,  1] 
of  n-valued  facts  (Ai)[ai,  1], . . . ,  (Am)[am,  1]  such  that 
a>i  >  *  •  *  >  Q>m  >  0  and  m  >  0.  A  substitution  9  is  applied 
to  a  subgoal  list  by  replacing  each  contained  atom  Ai  by 
Ai0.  For  n-valued  program  clauses  Pi  and  P2»  we  say  Pi 
is  a  variant  of  P2  iff  Pi  is  an  instance  of  P2  and  P2  is  an 
instance  of  Pi.  The  notions  of  unifiers  and  most  general 
unifiers  (mgu)  are  defined  as  usual. 

The  subgoal  list  (a(Bi)[b,  1] . . .  ( Bk)[b ,  l]u ))9  is  a  re¬ 
solvent  of  the  subgoal  list  a  (A) [a,  l]u;  and  the  n-valued 
program  clause  (H<-Bi, . . . ,  P*)[c,  1]  with  mgu  0  iff  A 
and  H  unify  with  mgu  9,  a  <  c,  and  b  =  a  —  c  +  1. 

Note  that,  for  subgoal  lists  a(A)[a,  l]u  and  n-valued 
program  clauses  ( H  «-JE?i , . . . ,  Bk )  [c,  1] ,  the  resolvent 
(a(Bi)[b,  1] . . .  (£*)[&,  1  }uj)9  is  a  subgoal  list,  since  0  <  a 
<  c  <  1  and  b  =  a  —  c  +  1  entails  0  <  b  <  1. 

An  SLDFv^-derivation  of  a  subgoal  list  Ro  from  an 
n-valued  logic  program  V  is  a  maximal  sequence  Ro, 
(C'o,  0o),  Ri  >  (Cu9i), . . . ,  where  ifo,  jRi,  . . .  is  a  sequence 
of  subgoal  lists,  Cb,Ci,...  is  a  sequence  of  variants  of 
clauses  from  V,  and  90,9i,. ..  is  a  sequence  of  substitu¬ 
tions  such  that  Ri+i  is  a  resolvent  of  Ri  and  C{  with  mgu 
0i  and  such  that  Ci  does  not  have  any  variables  in  common 
with  /Jo,  Co,...,  Ri- 1  ■  If  a  subgoal  list  Rj  is  empty,  then  it 
is  the  last  one  in  a  derivation.  Such  an  SLDPr* -derivation 
is  called  successful 

The  presented  SLDPr*-resolution  is  a  sound  and  com¬ 
plete  technique  for  correct  query  answering  in  Pr* .  That  is, 
for  n-valued  logic  programs  V  and  n-valued  queries  Qc  — 
3(Ai, . . . ,  Aj)[c,l]  with  c  >  0,  the  correct  answer  in  Pr* 
for  Qc  to  V  is  Yes  iff  a  successful  SLDPr*-derivation  of 
(Ai)[c,  1] . . .  ( Ai)[c ,  1]  from  P  exists.  Moreover,  each  suc¬ 
cessful  SLDPr* -derivation  of  (Ai)[c,  1] . . .  (A/)[c,  1]  from 
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V  with  the  sequence  of  substitutions  0O>  >  •  •  •  ?  Oj  provides 

a  correct  answer  substitution  in  Pr*  for  Qc  to  V  by  the  sub¬ 
stitution  0O 0i . . .  9j  restricted  to  the  variables  in  Qc. 

More  precisely,  the  soundness  and  the  completeness  of 
SLDPr* -resolution  is  expressed  as  follows. 

Theorem  3.10  a)  Let  V  be  an  n-valued  logic  program 
and  Qc  =  3(Ai, . . . ,  A{)[c,  1]  be  an  n-valued  query  with 
c>  0.  If  there  exists  a  successful  SLDPr^-derivation  of 
(Ai)[c,  1] . . .  ( Ai)[c ,  1  \from  V  with  the  sequence  of  substi¬ 
tutions  00, 0i , ... ,  Oj,  then  the  substitution  0o0i . . .  0j  re¬ 
stricted  to  the  variables  in  Qc  is  a  correct  answer  sub¬ 
stitution  in  Pr*  for  Qc  to  V.  b)  Let  V  be  an  n-valued 
logic  program  and  Qc  =  3(i4i, . . .  ,i4j)[c,l]  be  an  n- 
valued  query  with  c  >  0.  If  Yes  is  the  correct  answer  in 
Pr*  for  Qc  to  V ,  then  a  successful  SLDPr^-derivation  of 
(Ai)[c,  1] . . .  (Aj)[c,  1  \from  V  exists. 

3.6.  Computational  complexity 

We  now  focus  on  the  computational  complexity  of  many¬ 
valued  logic  programming  in  Pr* .  Like  in  Section  2.3,  we 
concentrate  on  the  two  decidable  special  cases  that  general¬ 
ize  propositional  logic  programming  and  the  decision  prob¬ 
lem  that  defines  the  data  complexity  of  datalog.  Crucially, 
in  contrast  to  the  probabilistic  many-valued  generalizations, 
the  truth-functional  ones  are  P-complete. 

Theorem  3.11  a)  The  optimization  problem  of  computing 
the  tight  answer  substitution  in  Pr ^  for  an  object- ground  n- 
valued  query  3{A\, . . . ,  At)[x,  1],  with  x  G  X,  to  a  ground 
n-valued  logic  program  V  is  P-complete.  b)  Let  $  be 
function-free.  Let  V  be  a  fixed  n-valued  logic  program ,  let 
T  be  a  varying  finite  set  of  ground  n-valued  facts.  Let  VU  T 
contain  all  constant  symbols  from  4>.  The  optimization 
problem  of  computing  the  tight  answer  substitution  in  Pr* 
for  an  object-ground  n-valued  query  3(Ai , . . . ,  Ai)[x,  1], 
with  x  G  X,  to  V  U  T  is  P-complete. 

4.  Summary  and  conclusion 

We  introduced  probabilistic  many-valued  logic  programs 
in  which  the  implication  connective  is  interpreted  as  mate¬ 
rial  implication.  We  showed  that  probabilistic  many-valued 
logic  programming  is  computationally  more  complex  than 
classical  logic  programming.  We  then  focused  on  the  ap¬ 
proximation  of  probabilistic  many-valued  logic  program¬ 
ming  by  many- valued  logic  programming  in  Pr*.  In  par¬ 
ticular,  we  introduced  a  sound  and  complete  proof  theory 
for  many-valued  logic  programming  in  Pr* . 

Crucially,  many-valued  logic  programs  in  Pr*  have  both 
a  probabilistic  semantics  in  probabilities  over  a  set  of  pos¬ 
sible  worlds  and  a  truth-functional  semantics  in  the  finite¬ 
valued  Lukasiewicz  logics  Ln.  Furthermore,  many- valued 
logic  programming  in  Pr*  has  a  model  and  fixpoint  charac¬ 
terization,  a  proof  theory,  and  computational  properties  that 


are  very  similar  to  those  of  classical  logic  programming. 
Hence,  it  is  well  worth  being  studied  more  deeply. 

Finally,  this  paper  showed  how  presumably  intractable 
probabilistic  deduction  problems  in  artificial  intelligence 
can  be  tackled  by  efficient  approximation  techniques  based 
on  truth-functional  many-valued  logics. 
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Abstract 

In  this  paper  we  present  a  method  for  automated  theorem  prov¬ 
ing  in  non-classical  logics  having  as  algebraic  models  bounded 
distributive  lattices  with  certain  types  of  operators.  The  idea  is 
to  use  a  Priestley-style  representation  for  distributive  lattices  with 
operators  in  order  to  define  a  class  of  Kripke-style  models  with 
respect  to  which  the  logic  is  sound  and  complete.  If  this  class  of 
Kripke-style  models  is  elementary,  it  can  then  be  used  for  a  trans¬ 
lation  to  clause  form;  satisfiability  of  the  resulting  clauses  can  be 
checked  by  resolution.  We  illustrate  the  ideas  by  several  examples. 


1  Introduction 

Efficient  reasoning  on  incomplete,  vague  and  imprecise 
knowledge  requires  the  development  of  efficient  many* va¬ 
lued  theorem  provers.  Since  many  non-classical  logics 
that  occur  in  a  natural  way  in  practical  applications  can  be 
proved  to  be  sound  and  complete  with  respect  to  certain 
classes  of  distributive  lattices  with  operators,  in  this  paper 
we  will  focus  on  this  kind  of  logics.  One  of  the  main  advan¬ 
tages  of  distributive  lattices  (with  well-behaved  operators) 
is  that  they  usually  have  good  and  economical  representa¬ 
tion  theorems.  The  method  for  automated  theorem  proving 
presented  here  uses  the  Priestley  representation  for  distribu¬ 
tive  lattices  with  operators.  We  illustrate  the  main  ideas  by 
several  examples,  one  of  which  is  presented  in  detail. 

The  main  contributions  of  this  paper  are  the  following: 

•  A  link  between  algebraic  and  Kripke-style  models 
for  logics  based  on  distributive  lattices  with  opera¬ 
tors,  established  by  using  Priestley-style  representa¬ 
tion  theorems.  This  extends  the  results  in  [21],  and 
makes  more  precise  some  results  from  [24]. 

•  A  method  for  translation  to  clause  form  in  logics 
based  on  distributive  lattices  with  operators. 

•  Decidability  results  for  some  of  the  logics  considered. 

Our  work  is  inspired  by  the  work  of  Baaz  and  Fermuller 
[1]  on  many-valued  resolution,  by  the  results  of  Hahnle 


[7,  8],  and  by  our  previous  work  on  finitely-valued  logics 
[21]  on  the  one  hand;  and  by  the  results  of  Goldblatt  [5]  on 
Priestley-type  dualities  for  distributive  algebras  with  opera¬ 
tors  on  the  other  hand.  The  method  presented  in  this  paper 
can  be  seen  as  a  common  framework  which  encompasses 
existing  results  in  automated  theorem  proving  in  both  regu¬ 
lar  finitely- valued  logics,  and  modal  logics. 

The  paper  is  structured  as  follows.  In  Section  2  we 
give  some  background  information.  In  Section  3  we  briefly 
present  a  Priestley  representation  for  distributive  lattices 
with  operators,  and  its  use  in  establishing  links  between  al¬ 
gebraic  and  Kripke-style  models.  In  Section  4  we  describe 
a  method  for  translation  to  clause  form  and  automated  theo¬ 
rem  proving.  In  Section  5  we  give  several  examples,  one  of 
which  is  presented  in  detail.  We  end  by  mentioning  related 
approaches  and  plans  for  future  work. 

2  Preliminaries 

Algebra.  In  what  follows  we  assume  known  standard 
notions,  such  as  partially-ordered  set,  lattice,  order-filter 
and  order-ideal  in  partially-ordered  sets,  meet-  and  join- 
irreducible  elements,  and  (prime)  filter  and  (prime)  ideal  in 
lattices.  For  definitions  and  further  information  we  refer  to 
[3].  Let  (X,  <)  be  a  partially  ordered  set  and  let  R  C  Xn+1 
be  a  (n  4-  l)-ary  relation  on  X .  R  is  an  increasing  relation  if 
it  has  the  property  that  for  all  x  £  Xn  and  every  y,z  £  X, 
if  R(x,  y)  and  y  <  z  then  R(x,  z).  R  is  a  decreasing  rela¬ 
tion  if  for  all  x  £  Xn  and  every  y,z  £  X,  if  R(x,  y)  and 
z  <  y  then  R(x>  z).  Given  a  partially-ordered  set  ( X ,  <), 
0(X)  will  denote  the  set  of  order-filters  of  X.  For  the  basic 
notions  of  universal  algebra  needed  in  what  follows,  such  as 
satisfiability  of  equations  in  an  algebra  or  a  class  of  algebras 

a  a 

(notation:  A  t\  =  t2  resp.  K  |=  h  -  t2)  and  the  defini¬ 
tion  of  a  variety,  we  refer  to  [2].  The  variety  generated  by  a 
class  K  of  algebras  will  be  denoted  HSP(K). 

Logic.  Roughly  speaking,  a  logic  is  described  by  a  pair 
C  =  (Fma (£),  \~c ),  where  Fma(£)  is  the  set  of  formulae 
in  the  logic  and  I ~c  is  a  relation  between  sets  of  formulae 
and  formulae  called  provability  relation  (for  simplicity  we 
only  consider  here  the  propositional  case).  A  formula  <f>  is  a 


0-7695-0161-3/99  $10.00  ©  1999  IEEE 


242 


theorem  of  C  (denoted  I ~c  <l>)  if  0  I ~c  <t>-  We  call  a  logic  C 
sound  w.r.t.  a  class  K.  of  models  if  every  theorem  <p  of  C  is 
true  in  all  models  in  X:  we  call  C  complete  w.r.t.  K.  if  every 
formula  0  that  is  true  in  all  models  in  K  is  a  theorem  of  C. 

3  Priestley  representation 

We  first  present  some  results  on  Priestley  representation  for 
distributive  lattices  with  operators  (abbreviated  in  what  fol¬ 
lows  dlo).  For  details  we  refer  to  [3, 5, 21, 24], 

The  Priestley  representation  theorem  [17]  states  that  ev¬ 
ery  bounded  distributive  lattice  A  is  isomorphic  to  the  lat¬ 
tice  of  clopen  (i.e.  closed  and  open)  order  filters  of  the  or¬ 
dered  topological  space  having  as  points  the  prime  filters 
of  A,  ordered  by  inclusion,  and  the  topology  generated  by 
the  sets  of  the  form  Xa  =  {F  \  F  prime  filter,  a  £  F}  and 
their  complements  as  a  subbasis.  We  denote  the  partially  or¬ 
dered  set  of  all  prime  filters  of  A,  ordered  by  inclusion,  and 
endowed  with  the  topology  mentioned  above,  by  D(A)  (we 
will  refer  to  it  as  the  dual  of  A).  The  lattice  of  clopen  order 
filters  of  an  ordered  topological  space  X  =  {X,  <,r)  will 
be  denoted  by  E{X).  The  Priestley  representation  theorem 
states  that  every  bounded  distributive  lattice  A  is  isomor¬ 
phic  to  E(D(A)). 

In  this  paper  we  will  also  consider  other  classes  of  oper¬ 
ators  on  bounded  distributive  lattices,  besides  V  and  A,  as 
explained  in  what  follows.  Let  £  be  a  signature  contain¬ 
ing  function  symbols  in  several  classes.  In  order  to  distin¬ 
guish  these  classes,  we  will  write  £  =  LhU  La  U  JhU 
MhUJaUMa,  where  Lh,  La,  Jh, Mh,  Ja,  and  Ma  may 
be  empty.  Let  DLOr  be  the  class  of  all  bounded  distributive 
lattices  with  operators  in  £,  such  that  for  every  A  £  DLOr 
die  operators  in  the  classes  Lh,  La,  Jh,  Mh,  Ja,  and  Ma 
are  respectively  lattice  morphisms  and  antimorphisms,  join 
and  meet  hemimorphisms  (maps  of  arbitrary  arity,  preserv¬ 
ing  finite  joins  resp.  meets  in  every  argument),  and  join  resp. 
meet  hemiantimorphisms  (maps  of  arbitrary  arity,  mapping 
finite  meets  to  joins  (resp.  joins  to  meets)  in  every  argu¬ 
ment).  The  main  fact  proved  in  [24]  is  that  the  additional 
operators  in  £  on  an  algebra  A  £  DLOs  induce  functions 
resp.  relations  on  D(A),  which,  again,  define  correspond¬ 
ing  operators  on  both  the  set  E(D(A))  of  clopen  order- 
filters  of  D(A)  and  on  the  set  0{D(A))  of  order-filters 
of  D(A).  For  details  on  a  Priestley  duality  theorem  for 
DLO  we  refer  to  [24].  The  main  result  we  will  use  here 
is  that  for  every  A  £  DLOs,  there  exists  an  isomorphism  of 
{0, 1,  V,  A}  U  S-algebras  t/a  ■  A  ~  E(D(A)).  Note  that  if 
A  is  finite  then  the  Priestley  topology  on  D(A)  is  discrete, 
hence  E{D(A))  =  0{D{A)). 

In  [21, 23]  we  used  this  Priestley  representation  for  DLO 
to  give  an  automated  theorem  proving  procedure  for  certain 
finitely-valued  logics.  In  this  paper  we  show  that  the  ideas 
are  much  more  general,  and  can  also  be  applied,  with  minor 
modifications,  to  wider  classes  of  non-classical  logics. 


3.1  Kripke-style  models 

In  [24],  inspired  by  the  definition  of  the  relational  Priestley 
spaces,  we  introduced  ordered  relational  structures. 

Definition  1  An  ordered  E-relational  structure  is  a  par¬ 
tially  ordered  set  endowed  with  maps  and  relations 

(X,  <){#x}HeihULai  { Rx}R€Jhl)Ja,{Qx}Q€MhUMa ) 

where  for  every  H  £  Lh,  Hx  :  X  ->  X  is  order¬ 
preserving,  for  every  K  £  La,  Kx  '■  X  — >  X  is  order- 
reversing, f&r  every  R  £  JhUJa  with  arity  n,  Rx  C  Xn+1 
is  an  increasing  relation,  and  for  every  Q  £  MhUMa  with 
arity  n,  Qx  C  Xn+1  is  a  decreasing  relation. 

The  class  of  all  ordered  S-relational  structures  will  be  de¬ 
noted  ORSr-  For  every  ordered  S-relational  structure  X,  its 
set  O(X)  of  order-filters  can  be  endowed  with  a  bounded 
lattice  structure  (where  join  is  union,  meet  is  intersection, 
0  =  0  and  1  =  X )  and  with  additional  operators  which  can 
be  defined  in  a  canonical  way  starting  from  the  maps  and 
relations  on  X  as  shown  in  what  follows. 

Theorem  1  ([5, 24])  Let  X  £  ORSr.  The  following  hold. 

1.  Every  order  preserving  (resp.  order  reversing)  oper¬ 
ation  on  X,  H  £  Lh  (resp.  K  £  La)  induces  a 
lattice  morphism  (resp.  lattice  antimorphism)  hn  : 
O(X)  O(X),  (resp.  kK  :  0(X)  ->  O(X))  de¬ 
fined  for  every  U  6  0(X)  by  hii{U)  =  Hf^ifJ), 
(resp.  kK(U)  =  X\KxHU)). 

2.  Every  increasing  relation  Rx  Q  Xn+1}  R  G  Jh 
(resp.  Rrx  C  Xn+x,  with  R*  G  Ja),  induces  a  join 
hemimorphism  Jr  :  0(X)n  -4  O(X)  (resp.  join 
hemiantimorphism  Jr<  :  0(X)n  -4  O(X)),  defined 
for  every  Uu...,Un  €  0(X)  by 

( Jh )  fR(Uu. . . ,  Un)  =  £  X  |  3xi  6  Uu . . .  ,xn  €  Un  • 

Rx  (®1  >  *  •  ♦  »  •£)}■) 

(Ja)  fRi(Ul,...,Un)  =  {x  e  X  I  3xi  &  Ul,...,Xn  i  Un  : 

Every  decreasing  relation  Qx  Q  Xn+1,  Q  €  Mh 
(resp.  Qfx  C  Xn+1,  with  Q*  G  Ma),  induces  a  meet 
hemimorphism  qq  :  0(X)n  O(X)  (resp.  meet 
hemiantimorphism  qq>  :  0(X)n  -4  0(X)),  defined 
for  every  Ui, . . . ,  Un  €  0(X)  by 

(Mh)  Qq(U\)  •  •  *  »  Un)  =  {x  €  X  |  Vxi,  .  .  .,Xn(Qx(x  1, 

®nj®)  ^  3i)Xi  G  £/*)}■> 

(Ma)  Qq*  (Uly .  .  .  ,  Un)  =  {x  G  X  I  Vxi, .  .  .  ,  Xn(Qx  (®1)  *  •  *  7 

•Cnj  “4  3 iyXi  U%)"^. 

3.  A  Hey  ting  algebra  structure  can  be  defined  on  0(X) 
by  defining  for  every  U\,U2  €  O(X) 

Ui  =>  U2  —  €  X  |  Vy,  x  <  y  if  y  G  Ui  then  y  G  ft} 

— «C7i  =  {x  G  X  |  Vy,  x  <  y  implies  y  &  Ui} 
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In  what  follows  we  consider  Kripke-style  models  for  log¬ 
ics  L  with  logical  connectives1  in  {V,  A}  U  E,  whose  for¬ 
mulae  are  built  from  propositional  variables  taken  from  a 
(countable)  set  Var,  and  satisfying  the  following  condition: 

(L)  C  is  sound  and  complete  w.r.t.  a  subvariety  Vc  of 
DLOe,  such  that  the  Priestley  duality  induces  a  dual 
equivalence  between  Vc  and  a  suitable  class  Vc$P  of 
relational  Priestley  spaces. 

Definition  2  Kc  is  the  subclass  of  ORSz  consisting  of  all 
relational  structures  that  satisfy  all  the  conditions  not  in¬ 
volving  the  topology  which  characterize  the  class  V^Sp. 

The  Kripke  models  for  C  we  consider  here  are  relational 
structures2  (X,  <,{Rx}Rez)  in  Kc,  endowed  with  a 
hereditary  meaning  function  m  :  Var  -4  0(1 f). 

Definition  3  Let  X  G  ORSs,  let  m  :  Var  -4  0(X)  be 
a  meaning  function ,  m  :  Fma(£)  -4  O(X)  its  unique 
extension  to  a  morphism  of  algebras,  let  x  G  X,  and  let 
<t>  G  Fma(£).  We  define: 

(V  X  (=m>a.  4>iffxe  m(<t>); 

(2) X^m<j>iffm(4>)  =  X; 

(3)  X  | =  <j>  iff  X  \=m  <j>  for  every  m  :  Var  -4  0(X ). 

Lemma  2  For  any  X  G  ORSr,  X  f=  <f>iffO(X)  £  0  =  1. 

Theorem  3  Let  Che  a  logic  that  satisfies  condition  (L).  If 
for  every  X  G  Kc *  O(X)  G  Vc*  then  C  is  sound  and  com¬ 
plete  w.r.t.  the  class  Kc •  Assume  that  Vc  —  HSP(A)  and 
let  Ka  be  the  class  of  all  ordered  relational  structures  sat¬ 
isfying  all  non-topological  conditions  corresponding  to  the 
Priestley  duals  of  the  elements  of  A.  If  for  every  X  G  Km 4, 
O(X)  G  Vc*  then  L  is  sound  and  complete  w.r.t.  Ka ■ 

Proof :  Without  loss  of  generality  we  only  prove  the  sec¬ 
ond  part  of  the  theorem.  Let  0  be  a  theorem  of  C.  Then 

a 

Vc  \=  <t>  =  1.  Since,  by  assumption,  for  every  X  €  K. 4, 
O(X)  e  Vc,  it  follows  that  X  |=  <f>  for  every  X  €  1C  a- 

r 

Conversely,  assume  that  for  every  X  G  Ka,  X  f=  <p.  Let 
A  G  A  be  an  algebra,  and  /  :  Var  -4  A  an  arbitrary  valua¬ 
tion.  By  the  Priestley  duality  for  Vc,  there  exists  an  isomor¬ 
phism  rjA  :  A  ~  E(D(A)).  Let  i :  E(D(A ))  -4  0(D(A)) 
be  the  inclusion,  and  let  m  :  Var  -4  0(D(A))  be  defined  by 
m  —  torjAof.  It  is  easy  to  see  that  m  =  oorjAof.  We  know 

1The  connectives  in  £  correspond  to  versions  of  non-classical  nega¬ 
tion,  or  are  generalizations  of  operators  from  modal  logic  such  as  those 
modeling  knowledge,  belief,  necessity,  or  possibility. 

2  Further  considerations  lead  to  relaxing  the  condition  that  <  is  a  partial 
order  to  <  being  reflexive  and  transitive.  Then  O(X)  is  the  family  of  all 
increasing  subsets  of  X  w.r.t.  this  relation. 


that  D(A)  G  Ka-  Thenra(</>)  =  D(A ),  hence  f(<p)  —  1. 
Since  Vc  =  HSP(A)  and  C  is  sound  and  complete  w.r.t. 
Vc  it  follows  that  h  c  <t> •  D 

Corollary  4  If  Vc  is  finitely  generated  by  the  family 
{ Ai , . . . ,  An}  of  finite  algebras  in  Vc  then  C  is  sound  and 
complete  w.r.t.  K  =  {D(Ai), . . . ,  D(An)}. 

A  similar  idea  occurs  in  the  work  of  Jonsson  and  Tarski 
[10]  on  Boolean  algebras  with  operators.  We  also  analyzed 
other  representation  theorems,  e.g.  for  orthomodular  lat¬ 
tices,  which  can  be  used  with  similar  results.  A  presentation 
of  these  results  is  beyond  the  scope  of  the  present  paper. 

4  Automated  theorem  proving 

Translation  to  clause  form.  Let  £  be  a  propositional  logic 
with  a  set  Var  of  propositional  variables,  and  satisfying  con¬ 
dition  (L),  and  let  Kc  be  as  in  Definition  2.  In  what  follows 
we  assume  that  C  is  sound  and  complete  w.r.t.  Kc . 

LemmaS  For  every  (X,  <,  {J?x}rtes)  G  Kc  and  every 
m  :  Var  -4  0(X);  for  every  H  G  Lh,K  G  La^R  G 
Jh,  Q  G  Mh ,  R '  G  Ja  and  Q '  G  Ma  the  following  hold: 


X 

K»,: 

v  P 

iff* 

x  G  m(p)  ifp  € 

;  Var 

X 

N  m,: 

E  01  V  <t>2 

iff 

x\=m, 

,x  4*1  or  X  |= 

4>  2 

X 

Nm,: 

c  <t>l  A  4>2 

iff 

X  Kn, 

,*  4>1  X  l=m, 

x  & 

X 

K 

'HW 

iff 

Xf- 

,HxM  4> 

X 

t  K{4>) 

iff 

X^m. 

,KX(x)  * 

X 

t=m,= 

c  >  •  • 
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Proof :  (Sketch)  Follows  from  the  definition  of  |=  and  from 
the  definitions  of  the  operations  on  O(X)  induced  by  oper¬ 
ations  or  relations  in  LA,  La,  JA,  MA,  Ja,  Ma,  as  given  in 
Theorem  1.  □ 

If  Kc  is  an  elementary  class  (i.e.  it  is  the  class  of  all 
ordered  £ -relational  structures  satisfying  a  set  E  of  first- 
order  formulae),  Lemma  5  can  be  used  in  order  to  define  a 
structure-preserving  transformation  method  to  clause  form. 
This  structure-preserving  translation  to  clause  form  is  in¬ 
spired  by  the  well-known  method  from  classical  logic  due 
to  Tseitin:  for  every  subformula  ip  of  <t>  a  new  (unary)  pred¬ 
icate  symbol  P^  is  introduced.  The  fact  that  this  renaming 
is  correct  is  expressed  by  a  set  (Ren)  of  formulae. 
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Proposition  6  Let  <j>  e  Fma(£).  Then  h c  <t>  iff  the  fol¬ 
lowing  conjunction  of  formulae  is  unsatisfiable  (where  the 
predicates  are  indexed  by  subformulae  j)  of  <j>): 

'  (Neg)  3x-P0(x) 

(Ren)  Va[P^lVv-2(®)  ++  Pj>i(x)  v  p^(x)] 

Vx[P1pl^1f,2(x)  -H-  P1j)l(x)  A  P1f,2(x)] 

Vx[PhW(x)  P*(H(X))] 

Vx[Pkw(x)  <•>  ~*+(K(x))] 


V*[PflWl>. 

,  xn,x)  A  A"=1 

3*1,  •  •  • 

Xn ,  •  •  * 

*♦«(■<))] 

v*[Pq(*i,. 

,xn,x)  ->■  vr= 

V*i,  . . . 

(•*))] 

Vx[Pr'Wi, 

■;1pn)(X)  ** 

3xi,  .  • . 

xn(R'(x  1,. . 

.,xn,x)  A  /\"=J 

“’P/’i  (z.))] 

Vs[Pq'Wi, 

^n)(x)  *+ 

Vxi,  . .. 

xn(Q'(x\,.. 

-»•  V” 

(*«))] 

Vx  x  <  x 

Vx,y,z[(x<yAy<z)-+x  <  z] 

Vx,  y[(x<yAP$  (x))  ->  P4  (y)] 

E  (the  set  of  first-order  formulae  that  characterize  Kc) 

Proof :  (Sketch)  It  follows  from  the  fact  that  for  every 
Kripke-style model  (X,  m),  where X  e  Kc  and m  :  Var  -> 
O(X)  such  that  X  <t>  we  can  construct  a  first-order 
model  of  (Neg)  U  (Ren)  U  (Str),  and  vice  versa.  □ 

The  set  of  formulae  (Neg)  U  (Ren)  U  (Str)  can  be  trans¬ 
formed  to  clause  form  in  the  usual  way.  Proposition  7  gives 
an  estimate  of  the  number  of  clauses  generated  from  a  given 
formula  <j>  by  using  Proposition  6. 

Proposition  7  Let  <j>  be  a  formula  of  C  containing  logical 
connectives  in  {V,  A}  U  Lh  U  La  U  Jh  U  Mh  U  Ja  U  Ma. 
Let  l  be  the  number  of  subformulae  of  <j>,  and  let  m  be  the 
highest  among  the  arities  of  the  operators  occurring  in  <j>. 
Then  the  number  of  clauses  generated  using  the  rules  in 
Proposition  6  is  at  most  l{m  +  2)  +  $,  where  s  is  the  number 
of  clauses  induced  by  the  formulae  in  (Str)  U  (Neg). 

Proof  :  It  follows  from  the  fact  that  the  formulae  in  (Ren) 
introduce,  for  every  operation  in  {V,  A},  3  clauses;  for  ev¬ 
ery  operation  in  Lh  U  La  2  clauses;  and  for  every  n-ary 
operation  in  Jh  U  Mh  U  Ja  U  Ma,  (n  +  2)  clauses.  □ 

Translation  to  clause  form  for  finitely-valued  logics.  If 
C  is  a  finitely- valued  logic  having  as  algebra  of  truth  values 
a  distributive  lattice  with  operators  the  procedure  described 
above  can  be  further  extended.  The  Priestley  dual  D(A)  of 
A  can  be  seen  as  a  finite  Kripke-style  frame  with  respect  to 
which,  by  Corollary  4,  C  is  sound  and  complete.  Even  a 
version  of  first-order  finitely- valued  logics  with  quantifiers 
V,  3  (interpreted  as  generalized  meets  resp.  joins  in  A ),  de¬ 
fined  by  following  the  ideas  in  [1],  is  tractable  this  way.  The 
rules  for  translating  to  clause  form  quantified  formulae  es¬ 
sentially  use  the  fact  that  every  element  in  the  lattice  of  truth 
values  can  be  uniquely  written  as  a  join  of  join-irreducible 
elements  [9,  21,  23];  this  argument  cannot  be  applied  if  the 
algebra  of  truth  values  is  infinite.  For  details  cf.  [21, 23]. 


For  finitely-valued  logics  the  use  of  quantification  over 
the  possible  worlds  can  be  avoided:  the  existential  and  uni¬ 
versal  quantifier  can  be  replaced  with  joins  resp.  meets  over 
all  the  elements  of  J 0(A),  which  is  a  finite  set.  The  ad¬ 
vantage  is  that  (for  propositional  logics)  in  the  clause  gen¬ 
eration  process,  we  remain  inside  propositional  logic.  The 
disadvantage  is  that,  in  the  presence  of  logical  connectives 
in  E  of  arity  greater  than  1 ,  it  may  lead  to  long  clause  forms. 

Theorem  proving.  The  translation  to  clause  form  described 
in  this  section  is  a  translation  to  classical  first-order  logic; 
thus  general  methods  for  automated  theorem  proving  in 
classical  logic  can  be  applied  to  the  resulting  sets  of  clauses. 

5  Examples 

We  now  illustrate  the  construction  of  classes  of  Kripke-style 
models  and  the  translation  to  clause  form  presented  above 
by  some  examples:  logics  based  on  (finite)  p-lattices,  log¬ 
ics  based  on  implicative  algebras,  logics  based  on  Ockham 
algebras,  relevant  logics.  Due  to  space  limitations,  only 
one  example  is  presented  in  detail;  for  the  others  the  main 
ideas  (especially  concerning  the  use  of  Priestley  duality  for 
constructing  classes  of  Kripke  models)  are  only  sketched. 
Other  examples,  such  as  SHn- logics  or  SHKn- logics,  are 
presented  in  detail  in  [21]. 

5.1  Logics  based  on  p-lattices 

A  distributive  p-lattice  is  an  algebra  (A,  V,  A,  *,  0, 1)  of  type 
(2, 2, 1, 0, 0)  such  that  (A,  V,  A,  0, 1)  is  a  distributive  lattice 
with  0  and  1,  and  *  is  an  operation  of  pseudocomplementa¬ 
tion,  i.e.  a *  =  max{6  €  A  |  a  A  b  =  0}. 

It  is  known  that  the  distributive  p-lattices  form  a  variety 
Bw,  and  that  the  subvarieties  of  Bw  form  a  chain  B_i  C 
Bo  C  ...  C  B„  C  ...  C  Bo;,  where  B„i  is  the  trivial 
variety,  Bo  the  variety  of  Boolean  algebras,  and  Bn  (n  >  1) 
is  generated  by  Pn  =  2n  ©  1  (the  n-th  power  of  the  2- 
element  chain  to  which  a  new  top  element  is  adjoined). 

Lemma  8  If  a  logic  C  is  sound  and  complete  w.r.t.  Bw,  it 
is  sound  and  complete  w.r.t.  the  class  of  all  Kripke  mod¬ 
els  ((X,  <),  m),  where  <  is  a  partial  order 3  on  X,  m  is  a 
meaning  function ,  and  the  satisfiability  relation  is  defined 
as  in  Lemma  5  for  <j>\  V  <t>2^<t>i  A  <j>2,  and,  additionally,  by: 

n,xr  iffVv(x<V^X£mty<l>). 

Proof :  Follows  from  the  duality  theorem  for  p-lattices  or 
Heyting  algebras  (cf.  [18])  and  Theorem  3,  and,  in  particu¬ 
lar,  from  the  fact  that  for  every  meaning  function  m  :  Var  -> 
O(X),  and  every  a:  €  X,  x  €  =  -> (fn(<f>))  =  {z  | 

Vy,  z  <  y  ->  y  &  fn(<f>)}  iff  Vy(a;  <  y  -»  y  &  rn(<j>)).  □ 

3One  can  in  fact  prove  soundness  and  completeness  w.r.t.  the  class  of 
the  Kripke  frames  ( X ,  <),  where  <C  X  x  X  is  reflexive  and  transitive. 
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Figure  1.  Vn  =  D{Pn) 

For  the  translation  to  clause  form,  the  semantics  of  the 
logical  connectives  V,  A  is  used,  as  explained  in  Section  4. 
Additionally,  for  formulae  starting  with  the  connective  *  the 
following  formula  has  to  be  added  to  (Ren): 

Va '\P^(x)  <4  (Vj t(x  <  y  -4  -^(s/)))]. 

Logics  based  on  finite  distributive  p-lattices.  Concern¬ 
ing  logics  having  Pn  as  an  algebra  of  truth  values,  we  make 
the  following  remarks  about  the  use  of  the  Priestley  repre¬ 
sentation:  the  Priestley  dual  of  Pn  =  2"  ©  1  is  the  space 
Vn,  represented  in  Figure  1.  Thus,  for  every  n,  P„  is  iso¬ 
morphic  to  the  lattice  of  order  filters  of  V„,  where  for  every 
U  c  0(Vn),  U*  =  X\  l  U.  The  finiteness  of  V„  leads 
to  the  following  simplified  rule  for  the  pseudocomplemen¬ 
tation  *,  without  quantification  over  the  elements  of  Vn : 

Pf  (-L)  <4  -iP^,(i.)  A  ->P«^(1)  A  ...  A  -'P^>{n) 

P^(i)  4^  ~~iP(f,(i')  (for  every  *<E  {l,...,n}). 

Proposition  9  The  number  of clauses  generated from  a  for¬ 
mula  <j>  in  a  logic  based  on  Bw  is  0(1),  where  l  is  the  length 
off.  In  a  logic  based  on  Pn,  the  number  of  clauses  gener¬ 
ated from  a  formula  <j>  is  0(nl)  and  all  clauses  are  ground. 

Proof  :  Since  the  formulae  for  renaming  introduce  at  most  3 
new  clauses,  in  the  case  of  logics  based  on  B u,  the  number 
of  clauses  generated  by  the  above  procedure  is  0(1).  In  the 
case  of  logics  based  on  P„,  due  to  the  particular  form  of 
Vn,  for  every  x  €  V'„\{±}  there  is  exactly  one  y  e  Vn  with 
y  >  x.  For  x  =-L  there  are  exactly  n  +  1  such  elements. 
Thus,  in  this  case  the  number  of  clauses  generated  from  a 
formula  <p  is  1  +3(fci  4  k2)  (n + 1) + fc3  ( (n + 2)  4  2n),  where 
ki,  k2,  k3  are  the  numbers  of  occurrences  of  V,  A,  resp.  *  in 
<j>.  it  is  easy  to  see  that  all  these  clauses  are  ground.  □ 

Proposition  1 0  Validity  of  propositional formulae  in  logics 
based  on  Bw  and  Pn  is  decidable. 

Proof.  If  we  analyze  the  form  of  the  clauses  in  (Str)  U 
(Neg)  U  (Ren)  for  a  formula  in  a  logic  based  on  Bw,  we 
notice  that  we  are  in  the  situation  studied  in  [4].  Hence,  the 
validity  of  formulae  in  logics  based  on  B^  is  decidable  by 
ordered  chaining  with  selection.  The  propositional  logics 
based  on  P„  for  some  n  are  of  course  decidable.  □ 

For  first-order  logics  based  on  P„  one  can  prove  the  decid¬ 
ability  of  certain  fragments  by  using  classical  decidability 
results  from  first-order  logic.  For  further  details  cf.  [22]. 


5.2  Other  classes  of  logics 

Intuitionistic  logics.  By  the  duality  theorem  for  Heyting 
algebras  (cf.  e.g.  [5])  and  Theorem  3  it  follows  that  intu¬ 
itionistic  logics  are  sound  and  complete  with  respect  to  the 
class  of  all  Kripke  models  (X,  <),  where  <  is  a  partial  order 
on  X,  a  well-known  result. 

Logics  based  on  Ockham  algebras.  Ockham  algebras  are 
distributive  lattices  with  0, 1 ,  and  one  unary  lattice  antimor¬ 
phism  ~.  They  are  models  for  logics  endowed  with  a  nega¬ 
tion  operator  that  satisfies  the  De  Morgan  laws.  A  Priestley 
duality  for  Ockham  algebras  is  given  in  [25].  The  corre¬ 
sponding  Priestley  spaces  are  endowed  with  a  unary,  contin¬ 
uous,  order-reversing  map  g .  By  Theorem  1  and  Theorem  3, 
the  logics  which  are  sound  and  complete  with  respect  to 
the  variety  of  Ockham  algebras  have  as  Kripke-style  models 
ordered  spaces  (X,  <,g)  endowed  with  an  order-reversing 
map  (the  condition  that  <  is  a  partial  order  can  be  relaxed 
to  the  condition  that  <  is  reflexive  and  transitive). 

Lukasiewicz  logics  of  order  n.  We  showed  that,  by  us¬ 
ing  the  Priestley-style  duality  for  implicative  algebras  due 
to  Martinez  [12],  in  the  case  of  Lukasiewicz  algebras  of  or¬ 
der  n  we  obtain  a  translation  to  clause  form  similar  to  the 
translation  to  mixed  integer  programming  given  in  [6]. 
Relevance  logics.  Urquhart  [26]  gave  an  algebraic  seman¬ 
tics  for  relevance  logics  in  terms  of  relevant  algebras  (a 
class  of  distributive  lattices  with  operators)  and  established 
a  Priestley  duality  for  this  type  of  algebras.  We  showed  that 
for  every  space  X  obeying  all  those  properties  of  the  Priest¬ 
ley  duals  of  relevant  algebras  which  do  not  involve  topolog¬ 
ical  notions,  O(X)  (with  operations  appropriately  defined) 
is  a  relevant  algebra.  Thus,  by  Theorem  3,  such  spaces 
provide  a  family  of  Kripke-style  models  for  relevant  log¬ 
ics,  which  coincides  with  the  Kripke  models  for  a  version 
of  relevance  logics  studied  by  Routley  and  Meyer  (cf.  [19]). 
Theorem  3  shows  that  soundness  and  completeness  with  re¬ 
spect  to  this  class  of  Kripke-style  models  can  be  proved  by 
completely  algebraical  means,  by  using  the  soundness  and 
completeness  with  respect  to  the  class  of  relevant  algebras. 

6  Related  Work 

The  work  reported  here  was  inspired  by  the  papers  of  Baaz 
and  Fermtiller  [1]  and  Hahnle  [7]  on  resolution-based  theo¬ 
rem  proving  in  finite-valued  quantificational  first-order  log¬ 
ics.  It  extends  the  ideas  in  [21]  where  we  gave  a  method 
for  translation  to  a  signed  clause  form  and  signed  resolu¬ 
tion  methods  for  (first-order)  finitely- valued  logics  having 
a  finite  DLO  A  as  algebra  of  truth  values.  The  signed  lit¬ 
erals  used  in  [21,  23]  were  of  the  form  |  a  \L*  and  j  ot  \lf, 
where  a  €  D(A).  Our  method  extended  existing  methods 
for  automated  theorem  proving  in  regular  logics  [7]  (there, 
the  literals  were  of  the  form  >  i  \l  and  resp.  [  <  ~i  \L,  for 
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some  truth  value  i)  to  cases  when  the  set  of  truth  values  is 
not  linearly  ordered.  In  this  paper  we  replaced  the  nota¬ 
tion  above  by  L(a)  resp.  L(a)  (the  translation  is  one  to 
classical  logic  [22]),  and  showed  that,  using  additional  the¬ 
oretical  considerations,  the  ideas  we  had  for  finitely- valued 
logics  can  be  extended  to  more  general  propositional  logics. 

Among  existing  methods  exploiting  the  structure  of  the 
algebra  of  truth  values,  we  mention  [16,  13,  15],  who  gave 
theorem  proving  systems  for  m-valued  Post  logics  and  for 
algorithmic  logics,  Salzer  [20]  who  studied  operators  and 
distribution  quantifiers  in  finitely-valued  logics  based  on 
semi-lattices,  and  Hahnle  [9]  who  derived  tableau-style  ax- 
iomatizations  of  distribution  quantifiers  by  using  BirkhofFs 
representation  theorem  for  finite  distributive  lattices.  Meth¬ 
ods  for  automated  proving  in  paraconsistent  or  annotated 
logics  (cf.  e.g.  [11])  also  have  similarities  with  our  approach 
in  the  case  of  finitely- valued  logics.  Known  methods  from 
modal  logic  seem  to  occur  as  particular  cases  of  general 
concepts  such  as  those  considered  here  (see  also  [5],  [14]). 

7  Conclusions 

In  this  paper  we  showed  that  the  Priestley  duality  for  dis¬ 
tributive  lattices  with  operators  provides  a  theoretical  tool 
for  clarifying  the  link  between  algebraic  and  Kripke-style 
models  for  certain  classes  of  non-classical  logics.  Based 
on  these  results,  we  gave  a  method  for  translation  to  clause 
form  and  automated  theorem  proving  for  some  of  these 
classes  of  non-classical  logics.  We  expect  that  this  will  en¬ 
able  us  to  use  recent  results  in  the  algebraic  investigation 
of  various  classes  of  distributive  lattices  with  operators  for 
obtaining  Kripke  models  and  automated  theorem  proving 
procedures  for  logics  with  interesting  practical  applications. 
Until  now  we  only  studied  the  existence  of  decision  proce¬ 
dures  in  certain  cases;  in  the  future  we  intend  to  continue 
our  work  in  this  direction. 
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Abstract 

In  the  last  years  two  automated  reasoning  techniques  for 
clause  normal  form  arose  in  which  the  use  of  labels  are 
prominently  featured:  signed  logic  and  annotated  logic  pro¬ 
gramming,  which  can  be  embedded  into  the  first.  The  un¬ 
derlying  basic  idea  is  to  generalise  the  classical  notion  of  a 
literal  by  adorning  an  atomic  formula  with  a  sign  or  label 
which  in  general  consists  of  a  possibly  ordered  set  of  truth 
values .  In  this  paper  we  relate  signed  logic  and  classical 
logic  more  closely  than  before  by  defining  two  new  transfor¬ 
mations  between  them.  As  a  byproduct  we  obtain  a  number 
of  new  complexity  results  and  proof  procedures  for  signed 
logics. 


1  Introduction 

In  the  last  years  two  automated  reasoning  techniques  for 
clause  normal  form  arose  in  which  the  use  of  labels  are 
prominently  featured:  from  generic  treatments  of  many¬ 
valued  logic,  so-called  signed  logic  emerged  (see,  for  exam¬ 
ple,  [6,  7,  3,  4,  14,  15])  while  annotated  logic  programming 
(see,  for  example,  [12,  8,  9])  was  motivated  by  attempts  to 
deal  with  inconsistency  in  deductive  databases.  Both  ap¬ 
proaches  are  closely  connected  to  each  other  [13,  10]  and  to 
constraint  logic  programming  [11].  In  fact,  annotated  logic 
can  be  embedded  into  signed  logic  [13]. 

In  each  case  the  underlying  basic  idea  is  to  generalise  the 
classical  notion  of  a  literal  by  adorning  an  atomic  formula 
with  a  sign  or  label,  which  in  general  consists  of  a  finite 
set  of  (truth)  values.  Whenever  the  values  appearing  in  the 
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signs  are  partially  ordered,  polarities  can  be  assigned  to  sig¬ 
ned  literals  in  a  natural  way  which  gives  rise  to  generalised 
notions  of  a  Horn  set.  It  turns  out  that  many  problems  can 
be  represented  more  succinctly  using  formulae  over  signed 
literals  whose  proof  procedures  and  complexities  are  often 
(but  not  always)  similar  as  in  classical  logic. 

In  the  present  paper  we  relate  signed  logic  and  classi¬ 
cal  logic  more  closely  than  it  has  been  done  before.  This 
is  done  by  defining  two  new  transformations  between  them. 
After  formal  definition  of  some  basic  notions  in  the  next 
section  we  start  in  Section  3  with  transforming  arbitrary 
classical  formulae  in  conjunctive  normal  form  (CNF)  into 
signed  CNF  formulae  with  at  most  two  literals  per  clause. 
This  provides  an  alternative  proof  of  NP-hardness  of  signed 
2-SAT  (first  proved  by  [14])  and  creates  the  possibility  to 
compare  classical  and  signed  deduction  procedures  experi¬ 
mentally.  In  Section  4.1  we  take  the  reverse  direction  and 
reduce  signed  Horn  formulae  based  on  certain  partial  orders 
to  classical  logic.  In  the  case  of  lattice  orders  this  yields  the 
new  result  that  generalised  Horn  problems  turn  out  to  have 
still  polynomial  complexity  with  respect  to  formula  size  and 
number  of  truth  values  (Section  4.2).  We  can  also  extract  an 
efficient  decision  procedure  based  on  generalised  unit  res¬ 
olution  (Section  4.3).  A  major  advantage  of  our  reduction 
to  classical  logic  is  that  it  scales  up:  we  demonstrate  this  by 
sketching  generalisations  to  infinite  orders  in  Section  4.4 
and  to  partial  orders  that  are  not  lattices  in  Section  4.5. 

Due  to  space  limitations,  most  proofs  had  to  be  omitted. 
A  full  version  of  this  paper  is  available  from  the  authors  on 
request. 

2  Basic  Definitions 
2.1  Syntax  of  Signed  Logic 

Definition  1.  A  truth  value  set  is  a  non-empty,  finite  set 
N  =  {ii,  «2, .  •  • ,  in}-  The  cardinality  of  N  is  denoted  \N\. 
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Definition  2.  Let  E  be  a  propositional  signature,  i.e.,  a  de¬ 
numerable  set  of  propositional  variables.  We  define  the  set 
of  atomic  signed  formulae  (or  signed  atoms  for  short)  as: 

{S:Pi  \SCN,Pi  EE}  . 

Definition  3.  Given  a  signed  atom  S  :  p,  then  5  is  said  to 
be  its  sign.  Let  >  be  a  partial  order  on  the  truth  value  set  N, 
let  denote  the  set  {j  E  N  \  j  >  i)  and  let  \.i  denote  the 
set  {j  6  N  |  j  <  i }.  If  a  sign  S  is  equal  to  either  f  i  or  4J, 
for  some  i  E  N,  then  it  is  called  a  regular  sign.  A  signed 
atom  with  a  regular  sign  is  called  a  regular  atom. 

Definition  4.  A  signed  clause  C  is  an  expression  of  the 
form 

Si  :pi,...,Sk  ■■ Pk  ->  s[  :qi,...,S'{:qi  , 

where  Si  :pi, . . .  ,Sk  :pk  and  SJ  :qi,...,S't  :  qt  are  sig- 
ned  atoms  and  k,  l  >  0. 

We  say  the  signed  atoms  Si  :  p\ , . . . ,  Sk  :  Pk  occur  with 
negative  polarity  in  C,  the  signed  atoms  S[  :  q\ , . . . ,  S[ :  qi 
occur  with  positive  polarity.  The  expression  on  the  left  of 
-¥  is  called  the  body  of  the  clause  and  the  expression  on  the 
right  is  called  the  head.  A  signed  formula  in  conjunctive 
normal  form  (CNF)  is  a  finite  set  of  signed  clauses. 

A  signed  clause  is  called  regular  if  (N,  >)  is  a  lattice  and 
it  only  contains  regular  atoms  with  signs  of  the  form  t  i.1  A 
signed  CNF  formula  is  called  regular  if  it  only  contains  reg¬ 
ular  clauses.  A  regular  clause  containing  at  most  one  atom 
with  positive  polarity  is  a  regular  Horn  clause.  A  regular 
CNF  formula  consisting  solely  of  regular  Horn  clauses  is  a 
regular  Horn  formula. 

In  signed  clauses  k  =  0  and  Z  =  0  are  allowed;  thus,  for 
signed  atoms  p ,  q,  both  p,  q  ()  and  ()  ->  p,q  are  signed 
clauses,  and  we  represent  them  by  p,  q  — ►  and  -4  p ,  q. 
When  k  =  0  and  1  =  0  we  have  the  signed  empty  clause, 
denoted  by  □. 

Definition  5.  The  length  of  a  signed  atom  S  :  p,  denoted  by 
\S  :p\,  is  |S|+1,  where  \S\  denotes  the  cardinality  of  S.  The 
length  of  a  signed  clause  C,  denoted  by  \C\,  is  the  sum  of 
the  lengths  of  the  signed  atoms  occurring  in  C .  The  length 
of  a  signed  CNF  formula  I\  denoted  by  |T|,  is  the  sum  of 
the  lengths  of  the  clauses  in  T. 

2,2  Semantics  of  Signed  Logic 

Definition  6.  An  interpretation  is  a  mapping  that  assigns 
to  every  propositional  variable  of  the  signature  E  a  truth 
value  of  N.  An  interpretation  I  satisfies  a  signed  atom 

Regular  clauses  could  also  be  defined  containing  only  signs  of  the 
form  4.  i  instead  of  signs  of  the  form  t  i-  The  results  of  this  paper  are  also 
valid  for  regular  clauses  defined  that  way. 


S:p,  in  symbols  I  |=  5  :  p,  iff  I(p)  €  S;  I  satisfies  a  signed 
clauseC  =  Si  :pi,...,S*  :pk  -4  S[  :  qi,  de¬ 

noted  1 1=  C,  iff  the  following  holds:  If  I  satisfies  all  of 
Si  :  pi , . . . ,  Sfc  :  pk  then  I  must  also  satisfy  at  least  one  of 
S[  :  qi , . . . ,  Sj  :  qi.  A  signed  CNF  formula  T  is  satisfiable 
iff  there  exists  an  interpretation  I  that  satisfies  all  the  sig¬ 
ned  clauses  in  T.  We  say  then  that  /  is  a  model  of  F  and 
we  write  /  |=  F.  A  signed  CNF  formula  that  is  not  satis¬ 
fiable  is  unsatisfiable.  The  signed  empty  clause  is  always 
unsatisfiable  and  the  signed  empty  CNF  formula  is  always 
satisfiable. 

In  particular,  I  satisfies  — >  £(  :  qi , . . . ,  5/ :  qi  iff  it  sat¬ 
isfies  at  least  one  of  S[  :  qi, . . . ,  Sj :  qr,  similarly,  I  satisfies 
Si  :  pi , . . . ,  Sk  :  Pk  iff  it  does  not  satisfy  at  least  one  of 
Si:pi,...,Sk  :pk- 

Observe  that  if  we  take  N  =  {true,  false},  assuming 
true  >  false,  and  consider  only  regular  atoms  of  the  form 
f  true  :  p,  then  we  obtain  the  logic  of  classical  conjunc¬ 
tive  normal  form:  t  true  :  P  is  equivalent  to  the  classical 
atom  p  if  it  occurs  with  positive  polarity,  and  to  the  negated 
classical  atom  -7 p  if  it  occurs  with  negative  polarity.  So, 
the  classical  clause  pi,...,Pk  ->  Qi > •  •  • , Qi  is  equivalent 
to  the  regular  clause 

t  true  :  pi , . . . ,  t  & rue  :  Pk 

t  true  :  ft, . . . ,  t  true  :  qi  . 

In  the  following,  when  we  refer  to  classical  clauses  we  use 
the  former  notation. 

In  classical  propositional  logic,  clauses  are  sometimes 
defined  as  a  finite  disjunction  of  literals  (i.e.,  signed  atoms 
or  negated  signed  atoms).  It  is  easy  to  see  from  the  pre¬ 
vious  definitions  that  pi, . . .  ,pk  qi, . . . ,  qi  is  logically 
equivalent  to  -ipi  V  •  ■  •  V  ->pk  V  qi  V  *  •  *  V  qi .  So,  classi¬ 
cal  atoms  occurring  with  negative  polarity  are  implicitly 
negated.  In  our  definition  of  signed  clauses,  signed  atoms 
occurring  with  negative  polarity  are  implicitly  negated  as 
well  in  the  sense  that  a  signed  atom  S  :  p  with  negative  po¬ 
larity  is  satisfied  by  an  interpretation  I  iff  I  S  :  p.  Nev¬ 
ertheless,  we  do  not  define  regular  clauses  as  a  disjunction 
of  regular  atoms  with  arbitrary  regular  signs  since,  as  we 
assume  N  to  be  partially  ordered,  an  occurrence  of  t  z :  p 
with  negative  polarity  is  not,  in  general,  logically  equivalent 
to  4  j  :p  for  any  j  E  N  and  can  therefore  not  be  represented 
by  a  positive  occurrence  of  some  literal  4  j  :p. 

2,3  Satisfiability  Problems 

The  propositional  satisfiability  problem,  briefly  called 
SAT,  is  the  problem  to  determine  whether  a  classical  CNF 
formula  is  satisfiable,  and  is  known  for  being  the  original 
NP-complete  problem  [1].  However,  there  exist  linear- time 
algorithms  for  solving  the  SAT  problem  when  we  consider 
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Horn  formulae  (Horn  SAT)  [2]  or  CNF  formulae  with  only 
two  literals  per  clause  (2-SAT)  [5].  When  a  CNF  formula 
admits  three  literals  per  clause  (3-SAT),  it  is  again  an  NP- 
complete  problem. 

In  the  last  years,  some  results  about  the  complexity  of  the 
satisfiability  problems  for  different  versions  of  signed  CNF 
formulae  have  been  published.  These  problems  have  the 
truth  value  set  N  (resp.  ( N ,  >))  as  a  second  input  parameter 
(besides  the  formula  T  to  be  tested  for  satisfiability).  Thus, 
signed  SAT  is  the  problem  of  deciding  for  an  arbitrary  for¬ 
mula  T  over  an  arbitrary  thruth  value  set  N ,  whether  there 
is  an  interpretation  over  N  satisfying  T.  We  also  consider 
decision  problems  where  N  is  not  an  input  parameter  but 
fixed,  which  we  denote  by  attaching  the  fixed  truth  value 
set  N  as  an  index  to  the  name  of  the  decision  problem.  For 
example,  given  a  fixed  truth  value  set  A7",  signed  SATjy  is 
the  problem  of  deciding  for  an  arbitrary  formula  T  over  N 
whether  there  is  an  interpretation  over  N  satisfying  T. 

The  classical  SAT  problem  is  trivially  reducible  to  sig¬ 
ned  SKY  {true,  false}*  therefore,  the  latter  and  the  more  gen¬ 
eral  problem  signed  SAT  are  NP-complete.  Signed  2-SATat 
is  known  to  be  NP-complete  for  \N\  >  3  [14]  (thus,  the 
general  problem  signed  2-SAT  is  NP-complete).  But,  the 
restriction  of  signed  2-SAT  to  the  case  where  all  signs  are 
singletons,  called  monosigned  2-SAT,  is  polynomially  solv¬ 
able  [14].  Concerning  regular  CNF  formulae,  it  is  known 
that  regular  Horn  SAT  [7,  3]  and  regular  2-SAT  [14]  are 
both  polynomially  solvable  in  case  the  partial  order  defined 
over  the  set  of  truth  values  is  total. 


and  H-,  and  for  each  clause  Cm  in  T  there  is  a  propositional 
variable  p'm  in  S'.  The  idea  is  that  p'm  has  the  truth  value  pt 
or  pj  in  a  (non-classical)  interpretation  /'  if  the  classical 
atom  pi  is  the  one  that  makes  the  clause  Cm  true  in  the  cor¬ 
responding  classical  interpretation  I.  That  is,  =  ft 

if  pi  is  false  in  I  and  occurs  with  negative  polarity  in  Cm, 
and  If(pfm )  =  Pi  if  Pi  is  true  in  I  and  occurs  with  positive 
polarity  in  Cm-  An  atom  can  only  have  a  single  truth  value 
whereas  a  clause  Cm  can  be  “made  true”  by  more  than  one 
of  its  literals,  in  which  case  an  arbitrary  one  may  be  chosen 
to  be  the  truth  value  of  pfm.  For  each  clause 

Cm  ”  ftm,l  1  *  *  *  1  Pirn, km  Pjm,\  ?  ’  ‘  *  >  PjmJrn 

in  T  there  is  a  unit  clause 


C'm  = 


{Ptm,  1  ’  *  •  *  5  5  P~jm ,  1  ’  *  *  *  1  P™ 


in  T'.  The  signed  atom  in  C. represents  the  fact  that  Cm 
(the  m-th  clause  of  T)  is  made  true.  Thus  T'  represents  only 
satisfying  truth  assignments  of  T. 

This  is,  of  course,  not  enough.  We  must  ensure  that  V 
in  fact  represents  solely  such  truth  assignments  for  atoms 
in  T  which  are  consistent  or,  in  usual  terminology,  which  are 
well-defined  interpretations.  For  this  purpose,  Tl  contains 
for  all  (classical)  clauses  Cm  and  Cn  in  T.  resp.,  for  all 
propositional  variables^  andp^  in  E'  (1  <  m,  n  <  r)  and 
for  all  atoms,  resp.,  truth  values  pi  (1  <  i  <  s )  additional 
clauses 


D'mni  =  ( Pi  }  ’Pm  (Sn\  { Pi  })  :  Pn 


3  Transforming  Classical  SAT  into 
Signed  2-SAT 

3.1  The  Transformation 

In  this  section,  a  mapping  '  is  defined  that  transforms 
classical  CNF  formulae  into  signed  (not  necessarily  regu¬ 
lar)  2-CNF  formulae,  in  other  words,  a  reduction  of  clas¬ 
sical  SAT  to  signed  2-SAT;  the  transformation  is  shown  to 
be  computable  in  polynomial  time.  We  define  9  as  follows: 
Let  T  be  a  classical  CNF  formula  with  clauses  C\ , . . . ,  Cr 
(r  >  1)  over  a  signature  E.  Assume  that  pi, . . .  ,p$  (s  >  1) 
are  the  propositional  variables  occurring  in  T;  thus,  the  clau¬ 
ses  in  T  are  of  the  form2 

=  Pim  ,  •  •  •  ,Pim  ykm  ft’m.lJ  *  *  •  iPjmJm  ’ 

We  associate  with  T  a  signed  2-CNF  formula  Tf  over  the 
truth  value  set  N £  =  {ft  ,  •  •  •  ,P7,Pi ,  *  •  •  ,Pt)  anc^  signa_ 
ture  E'  =  {p[, . . .  ,p'r},  i.e.,  the  truth  values  are  the  clas¬ 
sical  atoms  annotated  with  the  two  possible  polarities  — 

2RecalI  from  Section  2  that  the  atoms  in  T  really  are  signed  atoms  of  the 
form  t  true  :  p,  but  the  signs  are  not  shown  in  representations  of  classical 
CNF  formulae. 


where 

S'n  =  K„,X  >•••  ’P»7„,kn  ’Pi,!  ’■■■  ’Pl,ln  >  • 

The  signed  clauses  express  that  if  an  atom  is  used 

with  positive  polarity  to  “make  true”  some  clause  Cm  of  T , 
then  it  cannot  be  used  with  negative  polarity  to  “make  true” 
any  other  clause  of  T. 

The  clause  D may  be  omitted  from  T'  if  ft  does  not 
occur  with  positive  polarity  in  Cm  or  does  not  occur  with 
negative  polarity  in  Cn.  Instead  of  the  clauses  D'mni,  the 
clauses 

Ci  =  (Pi }  :  P’m  ->  (Sn\  { Pi  })  :  Pn 

can  be  used.  The  proof  of  Theorem  7  shows  that  it  is  indeed 
sufficient  to  either  use  only  the  clauses  D’mni  or  only  the 
clauses  E’mni. 

Example  1.  Consider  the  classical  CNF  formula  V  consist¬ 
ing  of  the  clauses 


(Cl) 

P  ->  Q 

(C2) 

q  -*•  p 

(Cs) 

p,q 
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The  only  model  I  of  T  is  defined  by  I(p)  —  I (q)  =  true. 
The  result  of  applying  the  mapping  'to  T  is  a  signed  2- 
CNF  formula  T*  over  the  signature  E7  =  {PiiP^Pz}  and 
using  the  truth  value  set  N £  =  {p~ ,  q~ ,  p+ ,  g+  };  T7  consists 
of  the  clauses 


(C'O 

<?+} 

:  p'i 

(C'2) 

{?”) 

p+} 

■p'2 

(C'z) 

->  {p+, 

9+} 

■p's 

C^21l) 

{p+} 

p'2 

-4 

U+} 

p'i 

(^31l) 

{p+} 

Ps 

-4 

ti+} 

p'i 

(^122  ) 

{9+} 

p'l 

-4 

{p+} 

p'2 

(-P322) 

{g+} 

p'z 

-4 

{p+} 

p'2 

In  (non-classical)  interpretations  J7  satisfying  T7,  the  truth 
value  of  p[  is  g+,  and  the  truth  value  of  p2  is  p+.  The  truth 
value  of  P3  can  be  either  p+  or  q+ ,  according  to  the  fact  that 
both  atoms  in  the  clause  C3  are  satisfied  by  the  classical 
interpretation  I. 

3.2  Results 

The  following  theorem  states  the  correctness  of  the  trans¬ 
formation  7: 

Theorem  7.  A  classical  CNF  formula  T  is  satisfiable  if  and 
only  if  there  is  an  interpretation  over  satisfying  T7. 

Proof  sketch.  Only-if-part :  Assume  that  the  classical  inter¬ 
pretation  I  satisfies  T.  Define  the  interpretation  V  over 
as  follows:  In  each  clause  Cm  G  T  there  has  to  be  an  atom  p 
such  that  (1)  I(p)  =  true  and  p  occurs  positively  in  Cm  or 
(2)  I(p)  =  false  and  p  occurs  negatively  in  Cm ,  because 
otherwise  Cm  were  not  satisfied  by  I.  If  there  is  more  than 
one  such  atom  p  in  Cm,  then  choose  an  arbitrary  one.  If  (1) 
holds  forp,  then  define  I'(pfm)  =  p+ ,  otherwise  (i.e.,  if  (2) 
holds  for  p)  define  7'(p^)  =  p".  It  is  easy  to  show  that  J7 
satisfies  the  clauses  Clm  and  for  all  1  <  m,  n  <  r  and 
1  <i<$. 

If  part:  Assume  that  the  interpretation  V  satisfies  T7.  We 
define  the  classical  interpretation  I  for  all  atoms  p  G  E  as 
follows:  If,  for  any  1  <  m  <  r,  there  is  an  atom  p^  such 
that  I'(pfm)  =  p+,  then  let  I(p)  =  true ;  let  J(p)  =  false 
otherwise.  It  is  easy  to  show  that  I  satisfies  all  clauses  Cm 
inT.  □ 

The  size  of  T7  is  easily  seen  to  be 

y ^  (fem  +  lm  +  1)  +  (fn  /n  +  2)  , 
m  '  H c[\  "  m,n,iV  =\Dfmni\ 


0(\T\3).  Obviously,  T7  can  be  constructed  in  time  which  is 
linear  in  its  own  size  and,  thus,  the  time  complexity  of  its 
construction  is  in  0(|r|3). 

Theorem  8.  The  transformation  7  is  computable  in  cubic 
time. 

In  [14],  NP-hardness  of  signed  2-SAT  was  proven  with 
a  poly-time  reduction  from  3-colourability  of  graphs  to  sig¬ 
ned  2-SATtv  with  |JV|  =  3.  As  classical  SAT  is  NP-com- 
plete,  NP-hardness  of  signed  2-SAT  (with  N  as  an  input 
parameter)  follows  as  well  as  a  corollary  from  Theorem  8. 

Corollary  9.  Signed  2-SAT  is  NP -complete. 

An  additional  benefit  of  the  transformation  7  is  that  it 
makes  it  possible  to  compare  classical  decision  procedures 
with  specific  procedures  for  signed  CNF. 

4  Transforming  Regular  Horn  SAT  into 
Classical  Horn  SAT 

4.1  The  Transformation 

In  this  section,  we  define  a  mapping  *  that  transforms 
lattice- ordered  regular  Horn  formulae  into  classical  Horn 
formulae;  and  we  prove  that  it  is  linear  in  the  size  of  the 
transformed  formula  and  quadratic  in  the  size  of  the  truth- 
value  lattice. 

We  assume  in  the  following  that  the  formula  to  be  trans¬ 
formed  does  not  contain  a  signed  atom  of  the  form  f  J- :  p, 
where  ±  is  the  bottom  element  of  the  truth  value  lattice. 
This  is  not  a  real  restriction,  as  such  atoms  are  true  in  all 
interpretations;  they  can  be  removed  from  a  formula  in  lin¬ 
ear  time  preserving  satisfiability  as  follows:  (1)  if  a  clause 
contains  a  negative  occurrence  of  f  -L  :  p,  then  remove  that 
occurrence  from  the  clause;  (2)  if  a  clause  contains  a  pos¬ 
itive  occurrence  of  f  _L  :  p,  then  remove  the  whole  clause 
from  the  formula. 

The  mapping  *  is  defined  as  follows:  Let  T  be  a  regular 
Horn  formula  over  the  truth- value  lattice  ( N ,  >)  not  con¬ 
taining  the  sign  f  -L.  Let  C\ , . . . ,  Cr  be  the  clauses  in  F 
(r  >  1),  let  pi, . . .  ,ps  G  E  be  the  propositional  variables 
occurring  in  T  (s  >  1). 

We  associate  with  F  a  classical  Horn  formula  T*  over  the 
signature 

E*  =  {t*  :p  |  i  €  N,p£  E}  , 

that  is  signed  atoms — including  their  signs — are  used  as 
propositional  variables.  A  transformation  based  on  the  same 
principle  is  described  in  [16];  it  allows  to  transform  formu¬ 
lae  from  certain  finite- valued  logics  whose  truth  value  lat¬ 
tice  is  distributive  into  classical  CNF  formulae. 


which  is  less  than  or  equal  to  |r|  +  r  +  2 r2s  where  r  is 
the  number  of  clauses  in  T,  and  s  is  the  number  of  differ¬ 
ent  atoms  occurring  in  T.  As  r,  s  <  |F|,  this  places  |T7|  in 
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For  each  1  <  m  <  r  the  classical  Horn  formula  T*  con¬ 
tains  the  clauses  from  T,  which  in  T*  are  re¬ 

garded  as  classical  clauses  over  £*.  In  addition,  for  all  truth 
values  i,j  £  N  and  all  propositional  variables  p*  occurring 
in  T  (1  <  A;  <  s),  F*  contains 

1 .  the  clause 

D*jk  =  t  i'-Pk  t  j-Pk  , 

provided  that  (a)  i  >  j  and  (b)  there  is  no  j'  £  N  such 
that  i>  j'  >  j, 

2.  the  clause 

E*jk  =  t* -PkAj -Pk  -+  t (*Uj):pfc  , 

if  neither  i  >  j  nor  j  >  i,  where  i  U  j  is  the  supremum 
of  i  and  j  in  the  truth  value  lattice. 

As  r*  contains  the  clauses  from  I\  the  classical  inter¬ 
pretations  satisfying  F*  satisfy  F  as  well.  The  additional 
clauses  D*jk  and  E*jk  ensure  that  such  a  classical  interpre¬ 
tation  I*  over  the  signature  £*  corresponds  to  a  well  defined 
interpretation  I  over  the  signature  £. 

The  clauses  D*jk  represent  the  fact  that,  if  I  satisfies 
t i :  Pk>  i.e.,  I(pk)  >  i  and  i  >  j ,  then  I(pk)  >  j  and  I  sat¬ 
isfies  fj  :  Pk  as  well. 

The  clauses  E*-k ,  on  the  other  hand,  represent  the  fact 
that,  if  I  satisfies  both  t  i  'Pk  and  f  i  ’Pk>  i.e.,  I(pk)  >  *  and 
I(Pk)  >  j ,  then  I(jpk)  >i\Jj  and,  thus,  I  {iU  j)  :pk. 

The  precondition  (a)  i  >  j  for  the  inclusion  of  the  clau¬ 
ses  D*jk  in  T*  is  necessary  for  the  correctness  of  the  trans¬ 
formation;  in  case  not  i  >  j ,  the  clauses  D*jk  are  (in  gen¬ 
eral)  not  satisfied  by  arbitrary  interpretations.  Contrary  to 
that,  the  precondition  (b)  for  the  inclusion  of  the  D^k  and 
the  precondition  for  the  inclusion  of  the  clauses  E*jk  is  only 
needed  to  avoid  redundancies. 

The  following  lemma  shows  that  clauses  D*jk  for  val¬ 
ues  of  i,j  violating  precondition  (b)  are  redundant.  They 
are  true  in  all  interpretations  satisfying  T*.  Therefore,  their 
inclusion  would  not  impose  any  further  restriction  on  the 
models  off*. 


The  clauses  E^-k  are  tautological  (and  redundant),  when¬ 
ever  i  >  j  or  j  >  i;  in  particular,  they  are  not  needed  if  the 
ordering  >  on  the  truth  value  set  N  is  total. 

According  to  the  following  lemma,  it  is  not  necessary  to 
include  in  T*  clauses  of  the  form 

t*i  :p,...,tig -P  -*  T | _ |{*i j  •••!*«}  :P 

for  q  >  2. 

Lemma  11.  Let  F  be  a  regular  Horn  formula  over  a  signa¬ 
ture  £,  let  I*  be  a  classical  interpretation  satisfying  F*,  let 
p  £  £,  and  let  M  C  N  be  a  non-empty  set  of  truth  values 
such  that,  for  all  j  £  M , 

/+(ti  :  P )  =  true  ; 

then 

/*(t|jM:p)  =  true  . 

Example  2.  Assume  that  N  =  {X,  T,  true,  false}  and  the 
partial  order  over  N  is  the  lattice  shown  below: 

T 

/  \ 

true  false 

\  / 

X 

Given  a  regular  Horn  formula  T  (over  signature  £),  for  each 
propositional  variable  j?  occurring  in  F  we  add  the  following 
classical  Horn  clauses  (over  the  signature  £*)  to  obtain  T* : 

(D*)  fT:p  ->  \true:p 

\d{)  fT  :p  ->  j  false  :p 

(D%)  t  true  :p  ->  |-L  '-P 

\d{)  f false  :p  ->  f±:p 

(E*)  f  true  :  p ,  f  false  :  p  — >  t  T  :  p 

The  following  theorem  states  the  correctness  of  the  trans¬ 
formation  * : 

Theorem  12.  A  regular  Horn  formula  T  is  satisftable  over 
some  truth-value  lattice  ( N ,  >)  if  and  only  if  T*  is  satisft¬ 
able. 


Lemma  10.  Let  T  be  a  regular  Horn  formula  over  a  signa -  4.2  Results 

ture  £,  let  I*  be  a  classical  interpretation  satisfying  F*,  let 

p€S,  and  let  j,  j'  be  truth  values  in  N  such  that  The  size  of  r»  is  easiiy  seen  to  have 


3  >  f 


|r|  +  3s|AT|2 


and 


I*(tj  ■ P )  =  true  ; 


then 


/*( ty  • P )  =  true  . 


as  an  upper  bound  where  s  is  the  number  of  different  atoms 
occurring  in  T. 

As  s  <  |T|,  this  places  |F*|  in  0(|r||A'|2);  and,  since  the 
time  complexity  of  constructing  F*  is  linear  in  its  size,  the 
reduction  *  is  in  0(|r||iV|2). 
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If  the  ordering  >  on  the  truth  value  set  is  total,  the  size 
of  T*  is  bounded  by 

|r|  + 2s|;v|  , 

as  in  that  case  there  are  only  |A|  many  clauses  Dijk  for 
each  A;  in  P\  and  no  clauses  Eijk  are  needed.  Then,  the 
transformation  *  is  in  C>(|r| |iV|). 

Theorem  13.  The  transformation  *  is  computable  in  time 
linear  in  the  size  of  the  transformed  formula  and  quadratic 
in  the  size  of  the  truth  value  set 

If  the  ordering  >  is  total  it  is  linear  in  both  the  size  of 
the  transformed  formula  and  the  size  of  the  truth  value  set 

Because  classical  Horn  SAT  is  solvable  in  linear  time  [2], 
we  obtain  the  following  corollaries  (for  distributive  lattices 
similar  results  were  proven  in  [16]): 

Corollary  14.  Regular  Horn  SAT  can  be  solved  in  time  lin¬ 
ear  in  the  size  of  the  formula  and  quadratic  in  the  size  of  the 
truth  value  lattice. 

Corollary  15.  For  all  fixed  truth-value  lattices  (A,  >),  reg¬ 
ular  Horn  SAT(Ni>)  can  be  solved  in  linear  time. 

In  the  special  case  of  totally  ordered  truth  values,  regular 
Horn  SAT  is  of  even  smaller  complexity  (which  was  already 
known,  see  [7]). 

Corollary  16.  Regular  Horn  SAT  with  a  totally  ordered  set 
of  truth  values  can  be  solved  in  time  linear  in  both  the  size 
of  the  formula  and  the  size  of  the  truth  value  set. 

4.3  Regular  Unit  Resolution 

In  this  subsection,  we  define  a  regular  unit  resolution  cal¬ 
culus  and  state  its  completeness  for  regular  Horn  clauses. 
The  calculus  is  based  on  the  inference  rules  shown  in  Ta¬ 
ble  1. 

Theorem  17.  A  regular  Horn  formula  T  is  unsatisfiable  if 
and  only  if  there  exists  a  derivation  of  the  empty  clause 
from  T  using  the  calculus  formed  by  the  PRUR  rule  and 
the  RR  rule. 

Proof  sketch.  Theorem  12  states  that  T  is  satisfiable  iff  T*  is 
satisfiable.  Thus,  we  know  that  T  is  unsatisfiable  iff  there 
exists  a  derivation  of  the  empty  clause  from  T*  using  classi¬ 
cal  positive  unit  resolution  (PUR),  since  this  rule  is  refuta¬ 
tion  complete  for  classical  Horn  formulae.  One  proves,  by 
induction  on  the  number  n  of  deduction  steps  in  that  deriva¬ 
tion  using  one  of  the  additional  clauses  that  are  in  T*  but  not 
in  T,  that  it  is  possible  to  construct  from  a  classical  (PUR) 
derivation  of  the  empty  clause  from  T*  a  deduction  of  the 
empty  clause  from  T  using  the  PRUR  and  RR  rules.  □ 


Our  regular  reduction  rule  can  be  seen  as  an  improve¬ 
ment  of  the  rule  presented  in  [8].  Whereas  they  provide  a 
top-down,  Prolog-like  proof  procedure,  we  have  defined  a 
bottom-up  procedure  based  on  unit  resolution.  An  alternate 
solution  with  an  extended  notion  of  signs  that  avoids  reduc¬ 
tion  rules  altogether  can  be  found  in  [9].  In  [8, 9],  however, 
complexity  issues  are  not  discussed. 

4.4  Infinite  Truth  Value  Lattices 

The  results  of  this  section  so  far  have  only  been  proven 
for  finite  truth  value  lattices;  for  example,  it  is  essential  for 
Lemma  1 1  to  hold  that  the  set  of  truth  values  is  finite. 

Nevertheless,  the  results  apply  in  many  cases  to  infinite 
truth  value  lattices  as  well,  because  it  suffices  to  consider 
the  sub-lattice  that  is  generated  by  the  truth  values  actually 
occurring  in  a  formula  and  the  bottom  element. 

Definition  18.  Given  a  regular  Horn  formula  T  over  a  (pos¬ 
sibly  infinite)  truth  value  lattice  (A,  >),  we  define  (Nr,  >) 
to  be  the  sub-lattice  of  (A,  >)  generated  by  the  elements  in 

{i  e  N  |  i  occurs  in  T}  U  {±}  . 

The  following  theorem  states  that  if  the  satisfiability  of 
a  formula  T  is  to  be  checked,  it  suffices  to  only  consider 
the  truth  value  lattice  (Nr,  >).  Thus,  if  Nr  is  finite  and 
effectively  computable  for  all  T,  then  all  previous  results  of 
this  section  can  be  made  use  of  by  considering  the  lattice 
(Nr,  >)  instead  of  (A,  >). 

Theorem  19.  Let  T  be  a  regular  Horn  formula  over  a  ( pos¬ 
sibly  infinite)  truth  value  lattice  (A,  >).  The  formula  T  is 
satisfiable  by  an  interpretation  over  the  lattice  (A,  >)  if  and 
only  if  it  is  satisfiable  over  the  lattice  (Nr,  >). 

Proof.  The  if-part  of  the  theorem  is  trivially  true,  because 
every  interpretation  over  (Ar,  >)  is  an  interpretation  over 
(A,  >)  as  well. 

To  prove  the  only-if  part,  assume  that  the  A-interpreta- 
tion  I  satisfies  T.  Define  the  Ar -interpretation  Jr  for  all 
atoms  p  e  S  by  Jr(p)  =  U  where 

Mp  =  {i  €  Ar  |  I(p)  >  i}  . 

It  suffices  to  show  that  for  all  truth  values  i  occurring  in  T 
(and  thus  in  Ar):  I  \=fi:p\f  and  only  if  Jr  |=  t  i  :  P • 

a.  Assume  that  1 1=  f  i  :p,  i.e.,  I(p )  >  i.  In  that  case, 
i  £  Mp  and,  by  definition  of  Jr,  we  have  Ir(p)  >  i  and, 
thus,  Jr  [=  t*  :P- 

b.  Assume  that  Jr  \=  f * :  P>  h(p)  >  i-  Since  I(p )  >  j 
for  all  j  €  Mp,  we  have 

I(p)  >\_\MP  =  Ir(p)>i 

and,  thus,  I  \=  f  i :  p  (note  that  the  supremum  operator  U  is 
the  same  in  both  lattices).  □ 
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Positive  Regular  Unit  Resolution  (PRUR) 

->  t i-P 

tn  -P,---Ah  ■ Pk  ->  t  j'Q 

t*i  :Pi) •  •  - > t »i— i  -Pi- I,t*i+1  -Pi+1,..-Aik -Pk  ->  1 3-Q 
provided  that  i  >  ii. 


Regular  Reduction  (RR) 

->  t*  : P 
-»•  ti 

-»•  t  (*  U  i)  :  p 

provided  that  neither  i  >  j  nor  j  >i. 


Table  1.  Inference  rules  of  the  regular  unit  resolution  calculus. 


Since  the  formula  T  is  finite,  the  set  of  elements  gener¬ 
ating  (Nr,>)  is  finite  as  well.  Therefore,  the  sub-lattice 
(Nr,  >)  is  finite  if  (N,  >)  is  locally  finite,  i.e.,  if  every  sub¬ 
lattice  generated  by  a  finite  subset  is  finite.  This  is,  for  in¬ 
stance,  the  case  if  the  lattice  (Nr,  >)  is  distributive. 

4.5  Extension  to  Partial  Orders  with  Maximum 

One  of  the  main  advantages  of  our  transformational  ap¬ 
proach  to  signed  logic  is  that  it  becomes  completely  trans¬ 
parent  which  additional  deductive  machinery  is  required  as 
compared  to  the  classical  case.  This  becomes  clearer  even 
when  we  go  beyond  lattice- ordered  regular  Horn  formulae. 

We  start  with  two  considerations  that  somewhat  limit  the 
terrain.  A  core  feature  of  any  efficient  deduction  procedure 
for  Horn  formulae  is  the  possibility  to  represent  the  con¬ 
junction  of  two  unit  clauses  as  a  single  unit  clause  as  wit¬ 
nessed  by  the  reduction  rule  in  the  previous  section.  This 
amounts  to  saying  that  signs  of  atoms  must  be  closed  under 
conjunction.  When  signs  are  upsets  this  condition  can  be 
expressed  as: 

For  all  i,j  €  N  there  is  a  k  G  N  such  that 
tinfj  =  f*  (  } 

Proposition  20.  Every  non-empty ;  finite  poset  that  satisfies 
(J)  is  an  upper  semi-lattice. 

Therefore,  it  is  inevitable  to  generalise  the  language  of 
signs  if  we  want  to  go  beyond  lattices.  A  natural  candidate 
for  an  enriched  language  of  signs  are  finite  unions  of  upsets 
which  can  also  be  seen  as  finitely  generated  filters.  In  the 
following  we  write  f  {i\ , . . . ,  i*  }  instead  of  t  i\  U  ■  •  ■  U  t  ik 
and  similar  for  4*.  We  extend  our  notion  of  regularity  (and 
hence  of  Horn  formulae)  as  follows: 

Definition 21.  If  a  sign  S  is  of  the  form  f  {<i, . . .  ,ik)  or 
4.{ii, for  some  {ii, . . .  ,**■}  C  N  and  k  >  1,  then 
it  is  called  a  regular  sign. 

A  signed  clause  is  called  regular  if  it  contains  regular 
atoms  with  signs  only  of  the  form  t  {*i?  -  *  ■  A  sig¬ 

ned  CNF  formula  is  called  regular  if  it  only  contains  regu¬ 
lar  clauses.  A  regular  clause  containing  at  most  one  regular 

3 The  remark  made  at  the  end  of  Section  2.2  applies  here  as  well. 


atom  with  positive  polarity  is  a  regular  Horn  clause.  A  reg¬ 
ular  CNF  formula  consisting  solely  of  regular  Horn  clauses 
is  a  regular  Horn  formula. 

The  next  question  is  which  partial  orders  can  be  cap¬ 
tured  if  we  want  to  retain  an  efficient  decision  procedure 
for  regular  Horn  formulae.  One  such  necessary  condition 
is  that  there  must  be  a  maximum  T.  To  see  this  consider 
N  =  {false,  true}  with  false  £  true  and  true  false. 
In  this  case  {false}  =  f  { false }  and  {true}  =  f {true}, 
so  each  classical  CNF  clause  can  be  expressed  as  a  regular 
Horn  clause.  Given 

C  =  pi,...,pk  , 

rewrite  C ,  for  example,  into: 

t {true}: px,..., ^ {true}  :pk, 

t  {false}  :  <?i , . . . ,  t  {false}  :  ®_i  -»  f  {true}  :  qt 

Hence,  we  cannot  expect  to  obtain  a  polynomial  decision 
procedure  for  such  Horn  formulae.  The  problem  is  that  by 
conjoining  regular  signs  f  false  and  f  true  we  can  express 
falsity  at  any  time  on  the  object  level  which  is  as  good  as  to 
admit  contrapositives  of  clauses. 

Finally,  we  sketch  how  partial  orders  with  maximum  lead 
to  a  reasonable  notion  of  generalised  Horn  formulae.  This 
can  be  done  via  a  reduction  to  lattice-ordered  Horn  formulae 
handled  in  the  previous  sections.  For  a  partial  order  (N,  <) 
with  maximum  T  consider  the  lattice  T~*~(N)  of  its  non¬ 
empty  order  filters.  Its  elements  can  be  represented  as  the 
non-empty  anti-chains  of  ( N ,  <),  that  is 

{S  |  0  ^  S  C  N,  and 

for  all  i,j  £  S:  if  i  ^  j  then  i  <£.  j,  j  ^  i}  . 

The  order  C  on  Jr+( N )  is  defined  as  5  □  S'  iff  ftS  D 
it  S',  where  if  S  is  the  filter  generated  by  S  in  N.  To  apply 
the  results  of  the  previous  sections  it  is  sufficient  to  show: 

Proposition 22.  A  regular  literal  fS  :  p  is  satisfiable  w.r.t. 
a  poset  with  maximum  ( N ,  <)  iff  it  is  satisfiable  w.r.t.  the 
lattice  F+(N). 4 

4In  the  latter  case,  of  course,  5  is  interpreted  as  a  single  lattice  element 
in  T+(N). 
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There  is  a  price  to  pay  for  the  increased  generality:  The 
lattice  T+(N)  can  be  considerably  larger  than  the  poset 
(jV,  <),  in  the  worst  case  exponentially  larger.  This  proves: 

Theorem  23.  Regular  Horn  SAT  formulae  based  on  posets 
with  maximum  can  be  solved  in  time  linear  in  the  size  of  the 
formula  and  exponential  in  the  size  of  the  truth  value  set. 

Example  3,  Consider  the  poset  N  depicted  below  on  the 
left.  The  lattice  jF+{N)  is  shown  on  the  right.  It  can  be 
seen  as  a  lattice-completion  of  N. 


{hj} 


From  a  deductive  point  of  view  it  is  important  to  com¬ 
pute  the  supremum  LJ  and  C  in  (N),  because  these  are 
required  in  the  reduction  and  unit  resolution  rule,  respec¬ 
tively.  This  is  done  as  follows: 

Let  ftS  =  ft{ii,...,i*}  and  ft  S'  =  ft  { Ji ,  — ,  Ji  > 
be  given.  We  denote  with  max(i,j)  the  set  of  minimal 
elements  above  i  and  j  in  N  w.r.t.  <.  Now  S  LI  S'  = 
ft  Soft  S'  =  {k  |  k  €  max(ij'),  i  e  S,  j  C  S'}.  From  the 
resulting  set  any  elements  not  minimal  in  it  can  be  deleted 
to  arrive  at  an  anti-chain  representation.  Finally,  S  C  S'  iff 
ft  5  D  ft  5'  iff  for  all  jr  6  Sf  there  is  a  is  €  S  such  that 

is  <  Jr- 

5  Future  Work 

An  investigation  of  the  lattice  theoretic  aspects  of  lattice- 
ordered  regular  Horn  formulae  could  lead  to  useful  new  re¬ 
sults.  In  particular,  in  the  infinite  case,  for  which  only  first 
ideas  have  been  presented  in  Section  4.4,  representation  the¬ 
ory  and  dualities  should  be  further  studied.  Priestley  duality 
has  already  been  successfully  exploited  in  the  case  of  dis¬ 
tributive  lattices  [15]. 

As  another  line  of  work,  experiments  should  be  carried 
out  to  compare  our  tailored  decision  procedures  for  regular 
Horn  formulae  with  procedures  for  classical  Horn  formulae 
after  applying  the  transformation  *  defined  in  Section  4. 
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Abstract 

The  study  of  semirigid  sets  arose  from  the  classification 
of  bases .  In  this  complex  problem  -  fully  solved  only  for 
\A\  =  2,3-  one  of  the  tasks  is  to  find  all  minimal  nontriv¬ 
ial  intersections  of  systems  of  maximal  clones.  Most  of  the 
clones  are  determined  by  reflexive  relations  (binary  or  of 
higher  arities)  and  so  we  need  to  determine  subsets  R  of 
these  relations  such  that  every  function  preserving  all  rela¬ 
tions  in  R  is  either  constant  or  is  a  projection. 

In  this  paper  we  give  a  short  overview  of  this  problem  for 
I)  isotone  relations,  2)  central  relations  and  3)quasi-linear 
relations.  Finally  we  add  some  new  results  for  4)  autodual 
clones. 


1.  Introduction 

In  the  sequel  the  universe  A  is  a  fixed  nonempty  set.  For 
a  positive  integer  n  denote  by  the  set  of  all  n-ary  op¬ 
erations  or  functions  on  A  (i.e.  maps  /:  An  A)  and 
put  Oa  •=  IXLi  0$°.  For  example,  for  1  <  z  <  n 
the  n-ary  operation  e”  on  A,  called  the  z-th  n-ary  projec¬ 
tion,  is  defined  by  setting  ef  (ai, . . . ,  an)  :=  a*  for  all 
ai} . . . ,  an  £  A.  Roughly,  a  clone  A  is  a  composition 
closed  subset  of  Oa  containing  the  set  Ja  of  all  projections, 
or,  equivalently,  the  set  of  term  operations  of  a  universal  al¬ 
gebra  {A;  F)  (where  F  C  Oa)-  For  a  more  precise  defini¬ 
tion  cf.  [8]  or  [20].  For  example,  the  sets  Ja  and  O a  are 
clones.  Also  the  set  Ka  consisting  of  all  projections  and  all 
constant  operations  (i.e.  /  G  O ^  with  \im  /|  =  1,  i.e.  tak¬ 
ing  a  single  value)  is  a  clone.  The  clones  on  A ,  ordered  by 
inclusion,  form  a  lattice  La  with  the  least  element  JA  and 
the  greatest  element  Oa  in  which  the  meet  of  any  family  of 
clones  is  their  set  theoretic  intersection.  Let  A  be  finite  and 
\A\  >  2.  Then  \LA\  -  2*°  [3]  and  the  lattice  is  largely  un¬ 
known.  The  dual  atoms  (=  coatomes,  i.e.  elements  covered 
by  O a  ),  called  maximal  or  precomplete  clones,  are  explic¬ 
itly  known.  Moreover,  each  proper  clone  is  a  subclone  of  a 


maximal  clone.  For  their  description  we  need  the  following 
concept. 

Let  h  be  a  positive  integer  .  A  subset  p  of  Ah  (i.e.  a 
set  of  /z-tuples  over  A)  is  an  /z-ary  relation  on  A.  An  n-ary 
/  G  Oa  preserves  p  if  for  every  h  x  n  matrix  X  =  [xif] 
over  A  whose  columns  are  all  h- tuples  from  p  we  have 

(/(#  11  j  •  •  • )  •Pin))  21  j  •  •  *  i  %2 n)>  •  *  *  >  f{xfil  i  •  •  •  >  C  P 

(i.e,  p  is  a  subuniverse  of  the  h- th  power  (A;  f)h  of  (A;  /)). 
For  example,  if  p  is  a  (partial)  order  <  (i.e.  a  binary  re¬ 
flexive,  antisymmetric  and  transitive  relation)  then  /  pre¬ 
serves  <  exactly  if  /  is  isotone,  i.e.  if  f(xn , . . . ,  %in)  < 
f(x 21, . .  .,*2n)  whenever xn  <  x2i, . .  .,x\n  <  x2n •  The 
set  of  all  f  E  Oa  preserving  a  given  relation  p  is  denoted  by 
Pol  p.  There  are  6  well-described  finite  families 
of  relations  on  A  such  that  {Pol  p  :  p  G  Hi  U  . . .  U  Re} 
are  precisely  the  maximal  clones  (cf.  [20]).  The  numbers 
of  maximal  clones  are  known  (cf.  [19]).  The  most  rich 
families  are  those  that  preserve  order  relations  and  central 
relations.  A  remarkable  fact  is  that  the  intersection  of  ail 
maximal  clones  is  the  least  element  Ja  of  La  .  Call  an  ft-ary 
relation  on  p  on  A  reflexive  if  (a, . . . ,  a)  G  p  for  all  a  £  A. 
The  nonreflexive  relations  in  71  :=  IZi  U . . .  U  Re  are  all  the 
unary  relations  p  (i.e.  subsets  of  A)  such  that  <j>  ^  p  A 
and  relations  of  the  form  {(a,  7 r(a))  :  a  £  A}  where  7r  is 
a  special  permutation  on  A  (investigated  in  Sections  4).  It 
is  immediate  that  KA  C  Pol  p  for  every  reflexive  relation 
p  on  A.  Call  a  set  {pi  :  i  £  1}  of  reflexive  relations  on 
A  semirigid  if  I\A  =  Hie/  Po1  The  fluestion  is  t0  de_ 
termine  semirigid  subsets  of  R.  A  unary  operation  f  on  A 
(i.e.  a  selfmap  of  A)  is  a  joint  endomorphism  of  a  set  R 
of  relations  on  A  if  /  G  O^  n  f  )peR  Pol  p.  The  set  of 
joint  endomorphism  of  R  is  denoted  by  End  R.  It  is  known 
(cf.  Lemma  ??  below)  that  R  is  semirigid  if  and  only  if  End 
R=K\  =  O^}  H  K.  Apparently,  the  first  known  example 
of  a  semirigid  set  of  relations  is  the  set  Oa  of  all  (binary) 
equivalence  relations  on  A  (cf.  [15,  Ch.l,  Probl.  2,p.38]). 
Two  clones  are  of  a  same  type  if  the  relations  they  preserve 
are  isomorphic.  It  is  natural  to  consider  semirigidity  first 
within  a  type.  As  there  are  variety  of  types,  we  shortly 
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overview  semirigid  sets  of  relations  on  A.  As  a  special  case 
(also  for  simplicity)  a  relation  p  is  semirigid  (C-rigid  [5]  or 
endorigid  [17])  if  End  p  =  K^\ 

2.  Reduction  to  unary  functions 

Let  C  be  a  clone  on  the  universe  A.  Call  := 
CnO(1>  (i.e.  the  set  of  unary  operations  from  C)  the  foun¬ 
dation  of  C  (hereafter  the  index  a,  denoting  the  universe,  is 
sometimes  omitted;  e.g.  denotes  O^). 

The  following  lemma  is  from  [5],  Prop.  2.2  and  also 
follows  from  [14]. 

Proposition  1.  Let  |A|  >  2.  Then  is  the  foundation  of 

a  clone  C  on  A  if  and  only  ifC  =  K. 

The  statement  is  not  true  for  |A|  =  2  (take  e.g.  A  = 
{0, 1}  and  the  clone  generated  by  the  constants  and  the  con¬ 
junction  A).  This  lemma  reduces  the  semirigidity  problem 
of  many-valued  logic  to  the  foundations  of  their  clones. 

This  lemma  was  strengthened  by  replacing  K  by  I<h-i, 
where  Kh-i  :=  {/  €  0\  \im  f\  <  h  -  1}  H  J  for  1  <  h  < 
k  :=  |A|  (the  case  h  =  2  reduces  to  Proposition  1)  [10]. 
However,  we  have: 

Proposition  2.  The  following  statement  is  not  true:  for  a 
clone  C  on  A  (\A\>‘1  finite) 

if  C(1)  =  J''V\  =  {e}  identity  map)  then  C  —  J. 

Proof  We  can  take  e.g.  the  clone  generated  by  the  conjunc¬ 
tion  A,  where  x  A  y  :=  min(a:,  y)  with  respect  to  the  natural 
order  0  <  1  ...<&—  1  (where  |A|  =  k).  □ 

In  Section  4  we  formulate  a  special  (weaker)  case  of  this 
statement  as  an  open  problem. 

3.  A  brief  overview 


Order  relations 

For  an  order  <  on  A  the  clone  Pol  (<)  is  maximal  if  and 
only  if  the  order  is  bounded  (=  it  has  a  least  element  o  and  a 
greatest  element  e,  i.e.  o  <  a  <  e  holds  for  all  a  €  A)  [9]. 
The  maximal  clones  of  the  form  Pol  (<)  are  exceptional  in 
the  sense  that  they  are  not  be  necessarily  finitely  generated 
[21].  It  seems  that  the  characterization  of  semirigid  set  of 
orders  is  not  easy.  Three  types  of  orders  has  been  consid¬ 
ered  so  far  1)  the  orders  of  height  1, 2)  the  chains  and  3)  so 
called  the  diamonds  (Fig.  1). 

In  [2]  a  fairly  general  approach  is  taken  successfully  to 
this  problem  in  the  framework  of  order  relations  (not  nec¬ 
essarily  bounded).  They  addressed  the  following  problem: 
What  is  the  least  m  such  that  there  are  orders  <i , . . . ,  <m 


1)  height  1  (fence) 


Figure  1.  Orders 


on  A  having  only  trivial  joint  endomorphism  (i.e.  End 
{<!, . . <m}  =  In  the  paper  semirigid  binary  rela¬ 

tions  p  (i.e.  an  oriented  graph  G  =  (A,  p'j)  are  constructed 
systematically  for  |A|  >  7  or  | -4  =  6.  Based  on  a  semi- 
rigidity  criterion  (Theorem  2.4  [5])  considerable  effort  is 
devoted  to  the  semirigidity  proof  of  the  graphs.  Then  the  p 
is  represented  as  a  (set  theoretical)  union  of  two  orders  < 
and  <'  both  of  height  1  (an  order  (A,  <)  is  of  height  1  if 
each  a  €  A  is  either  minimal  or  maximal).  We  have 

Lemma  3.  Let  R  be  a  set  of  binary  relations  on  a  set  A. 
Then  the  least  set  R*  of  binary  relations  on  A  containing 
R  and  closed  with  respect  to  (i)  arbitrary  (set-theoretical) 
unions  and  intersections  and  (ii)  inverses  satisfies 

End  R  C  End  R* . 

From  this  we  can  conclude 

End  {<,  <'}  C  End  p  =  ■ 

For  example,  for  a  graph  G  on  6  in  Fig.2, 

<  =  {(0, 1),  (2,3),  (4, 5) } U  la  and 

<'  =  {(1,2),(1,4),(3,0),(5,0)}Um 

where  la  =  {(a,a)|a  €  A}  . 

The  general  construction  fails  for  |A|  <  6.  The  excep¬ 
tion  for  |A|  =  7  can  be  fixed  separately.  For  |A|  =3,4 
there  are  no  semirigid  set  of  binary  relations  [1]  and  they 
must  be  considered  separately.  This  may  be  fixed  by  case 
analysis.  For  the  semirigidity  of  chains  and  diamonds  the 
situations  were  settled  completely.  For  |A|  =  3  we  need  at 
least  3  chains  while  for  |A|  >  3  there  are  semirigid  pairs  of 
chains.  For  diamond  orders  such  minimum  number  is  4  for 
|A|  =  4, 5  (for  |A|  =  3  the  diamond  reduces  to  a  chain). 
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3  2 


Figure  2.  A  semirigid  graph  G  (|^4|  =  6) 

In  [11]  the  problem  is  considered  for  chains  (=  linear 
orders).  A  simple  criterion  for  semirigidity  is  given.  For 
A  =  fc  =  {0, 1, . . . ,  k  - 1}  where  k  >  3  there  always  exists 
a  chain  which  together  with  the  natural  order  0  <  1 , . . .  < 
k  -  1  forms  a  semirigid  system.  For  example,  the  chain 
1  ^  3  0  ^  2  is  the  only  chain  such  that  for  k  —  4: 

0  <  1  <  2  <  3  and  1  -<  3  -<  0  -<  2 
form  a  semirigid  system. 

It  is  shown  that  the  proportion  of  such  chains  in  the  total 
of  A?!  chains  tends  to  1/e  =  0.13. .  .as  fc  becomes  large.  In 
[7]  the  problem  is  considered  for  diamond  orders.  An  order 
relation  <a6  is  a  diamond  provided  x  <ab  y  holds  exactly 
if  x  =  a  or  y  =  b.  The  semirigidity  criterion  is  given.  For 
example,  {<oi>  <03,  <12,  <13}  and  {<01  ><23,  <24,  <34} 
are  the  semirigid  systems  for  k  =  1  and  k  —  5,  respectively. 

The  number  of  semirigid  sets  of  diamonds  is  also  given 
explicitly  for  all  k  >  4. 

Central  relations 

For  a  set  of  h- ary  central  relations  (1  <  h  <  k  :=  |A|  > 
2)  the  semirigidity  problem  is  naturally  extended[10]  to  : 

A  set  of  h- ary  central  relations  on  a  set  A  is  called  semi¬ 
rigid  if  the  clones  of  fc-valued  logic  functions  determined 
by  the  relations  share  only  the  clone  Kh-i  consisting  of  all 
projections  and  all  functions  assuming  at  most  h-  1  values 
(1  <  h  <  k  :=  |A|  >  2).  For  h  =  2  this  definition  reduces 
to  the  semirigidity  described  before. 

It  is  shown  that  the  minimum  size  of  a  semirigid  set  of  h- 
ary  central  relations  is  h  -f  1  [10].  The  paper  [12]  has  some 
additional  results,  but  no  systematic  study  has  been  carried 
out. 

Quasi-linear  relations 


In  [13]  the  semirigidity  problem  is  considered  for  the  so 
called  quasi-linear  (or  affine )  clones  for  k  prime.  For  k  =  5 
we  need  at  least  3  such  clones  to  be  semirigid,  while  for 
k  >  5  prime  there  always  exist  a  semirigid  pair  of  clones. 
The  case  k  power  of  a  prime  has  not  yet  been  investigated 
(the  notion  of  quasi-linear  functions  becomes  complicated). 

4.  Autodual  clones 

For  areflexive  relations  the  semirigidity  problem  reduces 
to  the  rigidity  problem  because  they  do  not  admit  any  con¬ 
stant  functions  (thus,  a  set  {pi  :  i  G  1}  of  areflexive  re¬ 
lations  on  A  is  rigid  if  Ja  =  Die/  ^  P*)-  The  situa¬ 
tion  seems  to  be  much  simpler.  We  investigate  it  here.  Set 
A  :=  k  :=  {0, . . . ,  k  -  1}  and  let  S*  denote  the  symmetric 
group  over  k. 

For  a  permutation  s  G  Sk  set 

s°  =  {(x,s(a:))|a;  G  k}. 

The  operations  of  the  clone  Pols0  is  called  autodual  with 
respect  to  s.  It  is  known  that  Pol  s°  is  maximal  if  and  only 
if  the  permutation  s  is  the  product  of  i  cycles  of  the  length 
p,  where  k  —  p  ■  l  and  p  is  a  prime  number.  In  this  section 
we  fix  such  a  permutation  s. 

For  s  G  Sk  call  {s,  s2, . . . ,  sk},  where  sk  =  e  orbit  of 
s.  Two  permutations  are  orbit-free  if  their  orbits  share  only 
the  identity  function  e. 

The  following  lemma  is  known  (cf.  [6]): 

Lemma  4.  For  s,  s'  G  Sk  we  have  Pol  s°  =  Pol  s  °  if  and 
only  if 

s'  =  s'  for  some  i  G  {1,  •  •  •,  k  —  1}; 

in  other  words,  two  permutations  yield  distinct  autodual 
clones  if  and  only  if  they  are  orbit-free. 

For  fc  =  £  ■  p  (p  prime,  l  >  1)  and  k  = 

{(11,0,  •  •  •,  al,p-l>  a2,0,  •  •  •  ,<J2,p-l)  •  •  •!  at,<h  •  •  •  )  al,p- 1} 

let 

s(a.,r)  =  Ui.r+1  (mod  p)  (*  =  1 ,  •  •  • ,  f  i  r  =  0,  •  •  •  ,  P  ~  1)^ 

This  gives 

^ (u»',r)  (mod  p))  j  0,  1,  ... . 

As  elements  of  {ail0,...,a,>-i}  (*-th  component)  are 
shifted  cyclically  by  s,  we  may  assume  a», o  = 
min{atio,  •  •  •  ,a»,p-i}- 

From  the  definition  of  s°  a  unary  function  /  is  autodual 
with  respect  to  s  if  and  only  if 

«(/(*))  =  /(*(*))•  (2) 
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This  yields 

/(<v)  =  sr(/(a», o))  (r  =  0,. .  .,p-  l;i  =  1,..  .,£)■ 

(3) 

To  represent  a  unary  function  /  t  Pol  sD ,  let  us  call  the 
set  {a,,o,  •  •  • ,  i-th  cycle  of  /  and  call  a  restriction 

of  /  to  the  i-th  cycle  (a  map  from  the  i-th  cycle  into  fe) 
i-ih-component  of  /.  Denote  the  i-th  component  of  /  by 

•  •  •  j  Q>i,p— l 

.  /(a.'.o),  •  ••,  /(«i,P-i)  . 

We  have  the  following  component-wise  representation 
for  a  unary  autodual  function  /. 

Theorem  5.  A  unary  function  f  is  autodual  with  respect  to 
s  if  and  only  if  its  i-th  component  is  of  the  form 

j  •  •  «  y  ^i}p—  1 

aji,ri  j  •  •  •  j  aji,ri+p- 1  (mod  p)  m 

for  each  i  G  {1, . . . ,  £},  where  ji,  ■  ■  ■ ,  ji  €  {1, . . . ,  and 
n , . . . ,  rt  G  {0, . . . ,  p  —  1 }.  In  other  words,  f  should  map 
every  cycle  onto  a  cycle  preserving  s. 

Proof  (<^)We  show  that  /  defined  by  /(a,-,m)  := 
aji}n+m  (mod  P)  for  each  component  i  G  {1, . . . ,  t)  and 
m  €  {0, . . . , p  -  1}  satisfies  (2).  Indeed,  by  (1)  s(a,,m)  = 

ai,m- f  1  (mod  p)»  SO 

f{s{ai,m))  =  aji,ri+m+ 1  (mod  p) 

=  5(aji,ri+m  (mod  p))  = 

(=>)  Assume  that  /satisfies  (2).  Let  1  <  i  <  £  and 
f(ai)0)  =  cijiSi  for  some  ji  G  {1,...,^}  and  ri  G 

{0, _ —  1}-  Then  by  (3)  clearly  /(a;,r)  =  aji)rt+r  and 

file  i-th  component  of  /  has  the  required  form.  □ 

We  have  the  following  corollaries: 

Corollary  6.  The  number  of  unary  functions  that  autodual 
with  respect  to  s  is 

|(Pol  sa)M\  =  kl.  (4) 

Proof  We  have  l  choices  for  j,  and  p  choices  for  r,,  and 
this  can  be  done  for  each  i  =  1  □ 

Corollary  7.  The  number  of  permutations  on  k  autodual 
with  respect  to  s  is 

|(PolsD)^1,onto^|  —  t\pl .  (5) 

Proof  The  only  difference  is  that  the  choice  for  ji  should  be 
such  that  ji , . . . ,  ji  should  be  a  permutation  of  { 1 , . . . ,  £ } . 
So  we  have  a  total  of  £\  choices  for  the  set  {ji, . . .  ,jt}. 
For  rj  the  situation  remains  the  same  as  in  the  preceding 
corollary.  □ 

From  Theorem  5  we  have  the  following  results  on  the 
rigidity  problem  for  autodual  clones. 


Corollary  8.  For  k  prime  the  unary  functions  of  every  pair 
of  autodual  maximal  clones  shares  exactly  the  identity  func¬ 
tion  e. 

Proof  By  Theorem  5  the  set  of  unary  operations  of  Pol  sa  is 
exactly  its  orbit  {s,  s2, . . . ,  sp}.  By  Lemma  4  distinct  max¬ 
imal  clones  are  induced  exactly  by  orbit-free  permutations. 
Thus  they  share  exactly  the  identity  function  e.  □ 

The  following  result  is  also  immediate  from  Theorem  5. 

Corollary  9*  Let  two  permutations  s  and  sf  share  a  compo¬ 
nent  Then  {sD,  sD}  is  non-rigid. 


However,  the  next  example  shows  that  the  converse  is  not 
true;  i.e.  it  is  not  sufficient  for  rigidity  that  s  and  s'  share  no 
component. 

Example  1.  Let  s\  and  S2  be 


It  can  be  verified  that  s3  =  \l)  belongs  to  Pols?  n 

Pol  s° . 

The  following  example  shows  that  even  two  permuta¬ 
tions  with  different  prime  cycle-lengths  may  share  a  non¬ 
trivial  unary  function. 


Example  2.  Both  clones 


admit  the  non-trivial  unary  function 


012345 

453201 


We  have  considered  unary  functions  of  autodual  clones. 
We  formulate  some  open  problems  in  order  to  solve  the 
rigidity  problem. 

Let  P  denote  the  set  of  prime  divisors  of  fc.  Denote  by  S 
the  set  of  all  permutations  of  fe  =  {0, . . . ,  k  -  1}  with  k/p 
cycles  of  length  p  where  p  runs  through  P. 

Question  1.  Suppose  that  si , . . . ,  sm  G  E  are  such  that 
(n*LiPols?)(1)  =  {e}.  Is  it  true  that  Pols?  —  J  (= 
the  clones  of  all  projections)? 

If  the  answer  is  yes,  then  Corollary  8  means  that  ev¬ 
ery  autodual  maximal  clones  is  rigid  for  k  prime.  One 
should  start  looking  at  k  =  5.  Indeed  for  k  =  3  there 
is  only  one  clone  PolsD  with  s  G  E.  For  k  =  4  the 
set  E  is  {si,S2,s3}  given  in  Example  1  whereby  s*  G 
Pol  si  □  H  Pol  n  Pol  s3D  (i  =  1,2,3)  and  therefore 
{sD  | s  G  E}  is  not  rigid. 

Question  2.  Characterize  rigid  {s?,...,s°}  with 
Si , . . . ,  sm  G  E. 

Again,  one  should  start  looking  at  cyclic  orbit-free  per¬ 
mutations  of  5. 
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5.  Conclusions 

We  briefly  overviewed  the  semirigidity  problems.  We 
conjectured  that  autodual  clones  are  rigid  to  each  other  for 
k  prime.  Unfortunately,  we  could  not  solve  the  problem 
completely.  Instead,  we  listed  some  open  questions. 

For  the  constant-preserving  maximal  clones  (unary  rela¬ 
tions  corresponding  to  a  non-empty  proper  subset  0  f  p  ^ 
A),  the  trivial  Ar-system  of  relations  {{()}. . . . ,  {A-  —  1}}  is 
rigid  and  k  seems  to  be  the  minimum  number  of  cardinality 
for  such  a  system. 

Several  types  of  relations  can  be  investigated  further. 
Among  them  we  include  /i -regular  relations  (the  only  re¬ 
maining  family  among  U  that  is  not  investigated  yet).  Semi¬ 
rigidity  among  different  types  of  relations  could  be  investi¬ 
gated  systematically. 

Acknowledgement.  The  author  is  grateful  to  Prof.  I.G. 
Rosenberg  for  his  valuable  suggestions.  The  author  bor¬ 
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Abstract 

With  the  growing  interest  and  the  emergence  of  various 
implementations  of  Multiple-Valued  logic  (MVL)  circuits, 
testability  issues  of  these  circuits  are  becoming  crucial 
Fault  characterization  is  an  early  step  in  the  test  genera¬ 
tion  process.  It  is  aimed  at  finding  fault  models  that  best 
describe  the  possible faults  expected  to  occur  in  a  given  class 
of  circuits  or  technology.  Layout  and  device  level  studies 
on  CMOS  and  BiCMOS  circuits  revealed  that  the  stuck-at 
model  is  not  adequate  to  represent  the  actual  physical  de¬ 
fects.  In  this  paper,  our  aim  is  to  characterize  faults  in  a 
CMOS  functionally  complete  set  of  MVL  operators.  The 
set  has  been  implemented  using  existing  standard  binary 
CMOS  technology.  This  enables  us  to  characterize  faults 
in  these  operators  using  the  same  techniques  used  for  stan¬ 
dard  binary  CMOS.  Fault  categories  in  MVL  circuits  and 
recommendations  for  testability  will  be  given. 


1  Introduction 

It  has  been  recently  possible  to  implement  MVL  circuits 
using  the  available  binary  technologies,  such  as  the  standard 
CMOS  technology.  This  resulted  in  various  MVL  imple¬ 
mentations  either  as  stand  alone  circuits,  e.  g.  multiplier 
chips,  or  as  modules  integrated  with  binary  circuits  [5,6]. 

With  the  increasing  number  of  MVL  implementations, 
testing  such  circuits  is  becoming  crucial.  It  is  more  involved 
to  test  an  MVL  circuit  as  compared  to  its  binary  counter¬ 
part.  This  is  due  to  the  increased  number  of  logic  values  and 
the  need  to  distinguish  and  eliminate  an  increased  number 
of  logic  choices.  However,  during  the  test  generation  pro¬ 
cess,  there  exists  some  flexibility  since  decisions  to  assign  a 
certain  value  to  a  line  are  not  restricted  to  0  or  1  only. 


Previous  work  on  MVL  testing  has  been  primarily  based 
on  the  stuck-at  fault  model.  The  adequacy  of  this  model  was 
questioned  by  various  studies.  For  this  reason,  and  due  to  the 
increased  complexity  of  test  generation  of  MVL  circuits,  it  is 
very  important  to  recognize  the  actual  type  of  faults  expected 
to  occur  in  MVL  implementations.  This  characterization 
becomes  an  important  input  to  any  test  generation  algorithm. 

In  this  paper,  we  study  the  issue  of  fault  characterization 
and  testability  considerations  in  CMOS  MVL  circuits.  The 
paper  is  organized  as  follows.  Section  2  introduces  back¬ 
ground  material  and  the  considered  MVL  set.  Section  3 
describes  the  fault  characterization  methodology  used.  In 
Section  4,  characterization  results  in  the  MVL  sets  are  pre¬ 
sented.  Testing  for  the  characterized  faults  is  derived  in 
Section  5 .  Recommendations  for  testability  of  MVL  circuits 
are  presented  in  Section  6.  The  paper  ends  with  concluding 
remarks  and  future  work. 

2  Background  Material 

In  this  paper,  we  use  a  functionally  complete  set  of  MVL 
operators  [5,  6].  This  set  has  been  implemented  using 
CMOS  and  consists  of  five  operators:  literal ,  complement 
of  literal ,  cycle ,  tsum  and  min .  It  has  been  shown  that  the 
introduced  set  is  more  efficient  in  realizing  MVL  functions 
in  terms  of  the  number  of  operators  used  and  the  overall 
required  circuits  [6].  The  functionality  of  the  set  is  verified 
by  HSPICE  simulation  [5].  We  will  use  the  reported  set  re¬ 
alizations  to  study  the  possible  types  of  faults  that  can  occur 
at  the  end  of  a  production  line.  The  methods  used  to  model 
possible  faults  in  binary  CMOS  studies  will  be  applicable 
to  the  above  MVL  realizations  since  they  are  built  using  the 
available  binary  CMOS  technology. 

Let  R  —  {0, 1, 2, ...,  r  -  1}  be  the  set  of  logic  values 
for  an  r-valued  logic  where  r  is  the  radix.  Assume  also 
that  a,  6,  E  R,  a  <  b  and  k  E  {1,2, ...,  r  -  1}.  The  MVL 
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operations  used  in  this  paper  are:  min,  tsum,  literal,  and 
cycle.  The  definitions  of  those  operations  can  be  found 
in  [5]. 

There  have  been  tremendous  efforts  to  understand  and 
characterize  physical  defects  in  ICs  and  their  effect  on  the 
circuit  behavior.  Many  studies  were  conducted  on  faulty 
chips  taken  from  actual  production  lines.  In  general,  to 
find  the  effects  of  defects  on  the  circuit  behavior  (fault 
characterization),  a  defect  is  inserted  in  the  circuit.  Then, 
simulation  is  performed  to  find  the  circuit  behavior  under 
that  defect.  This  process  can  be  done  at  two  different  levels, 
namely  at  the  layout  level  [1, 4]  or  at  the  device  level  [3,8]. 
In  the  first,  a  defect  is  inserted  in  the  layout  of  a  fault  free 
circuit.  The  new  circuit,  with  the  inserted  fault,  is  then 
extracted  from  the  layout.  Next,  the  resultant  circuit  is 
simulated  to  study  its  behavior  under  the  inserted  fault.  In 
the  second  approach,  shorts  and  opens  are  inserted  directly 
to  the  device-level-representation  of  the  fault  free  circuit. 
The  behavior  of  the  resultant  c!  !  ircuit  then  is  studied. 

The  aim  of  the  research  work  in  this  paper  is  to  charac¬ 
terize  faults  in  MVL  circuits  on  the  device  level.  Defects 
will  be  inserted  into  each  circuit  and  simulations,  of  the  re¬ 
sultant  circuits,  will  be  conducted.  The  circuits  behavior 
in  the  presence  of  the  defects  will  be  classified  into  fault 
categories  to  give  statistics  on  the  types  of  faults  expected 
to  be  found  in  these  circuits.  For  this  purpose,  components 
of  the  functionally  complete  set  of  MVL  operators,  intro¬ 
duced  earlier,  will  be  used.  The  methodology  used  for  fault 
characterization  is  explained  in  the  next  section. 

3  Fault  Characterization  Methodology 

In  characterizing  faults  in  the  considered  set,  the  follow¬ 
ing  steps  are  used. 

L  Defects  are  inserted  on  transistor  terminals  (3  shorts 
and  3  opens,  see  Figure  1)  and  considered  one  at  a 
time.  This  will  be  performed  for  each  operator  circuit. 

2.  SPICE  will  be  used  to  determine  the  output  of  the 
circuits  in  the  presence  of  each  inserted  defect. 

3.  Simulation  results  will  be  analyzed  to  provide  statis¬ 
tics  on  faults  and  their  occurrences. 

4.  Recommendations  for  MVL  testability  will  be  made 
based  on  the  characterization  results. 

In  the  simulations,  the  resistance  of  the  inserted  short 
faults  will  be  varied.  Typical  resistance  values  considered 
for  CMOS  technology  vary  from  several  Ohms  up  to  4k 
Ohms.  Shorts  may  occur  with  higher  resistances  but  with 
low  percentage  and  very  high  resistance  value  that  will  not 
affect  the  circuit  behavior  [2, 7, 9].  An  open  is  modeled  as  a 
10  Gig.  Ohms  resistors  in  parallel  with  a  0.00  lpF  capacitor 


between  the  transistor  terminal  and  the  node  it  is  normally 
connected  to.  Figure  1  shows  the  open  and  short  faults  along 
with  their  models. 


Figure  1.  Open  and  short  faults  and  their 
models. 

The  transistors  in  each  circuit  will  be  numbered  arbitrar¬ 
ily  starting  with  Current  Mode  CMOS  (CMCL)  transistors 
first  then  Voltage  Mode  CMOS  (VMCL)  transistors.  An 
open  at  the  gate  of  transistor  number  1,  for  example,  will 
be  referred  to  as  Olg.  Old  and  Ols  will  refer  to  opens  at 
transistor  1  drain  and  source,  respectively.  Slgs  refers  to  the 
inserted  short  between  the  gate  of  transistor  1  and  its  source. 
The  following  terms  and  abbreviations  will  be  used: 

•  SAk  means  the  output  stuck  at  k  under  the  simulated 
fault. 

•  aSAO  faults  in  the  literals  caused  the  value  of  a  in 
ka  [x]&  to  be  as  if  it  was  0.  For  example,  if  the  original 
circuit  is  kl  [x]x,  the  resultant  circuit  will  be  k°[x] 1 . 

•  bS A3  faults  convert  the  circuit  to  ka[a:] 3 .  This  has  no 
effect  if  the  b  in  the  original  circuit  was  3. 

•  Para.k  fault  refers  to  a  parametric  change  of  the  value 
of  k .  This  value  is  supposed  to  be  the  same  for  the 
whole  input  range  in  [a,  b].  Under  this  fault,  k  takes 
several  current  levels,  depending  on  the  input  value, 
which  may  be  read  as  different  logic  values.  For 
example,  if  the  circuit  3 1  [a?]3  undergoes  this  fault,  the 
values  of  the  output  current  may  be  different  from 
60uA  (the  correct  value)  for  different  inputs.  For 
example  it  could  be  15,  40,  60  uA  for  x  =  1,  2  and 
3  respectively.  This  implies  that  even  if  the  circuit 
shows  the  correct  output  for  some  input  values  in  the 
[a,  b]  range,  it  could  be  wrong  for  others.  This  implies 
that  the  output  should  be  checked  at  every  possible 
input  value  in  the  range. 
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•  3*  faults:  the  normal  value  of  the  3  logic  level  is 
60uA.  3*  faults  cause  this  value  to  be  higher  than 
60uA.  Typical  examples  found  were  in  the  range  of 
80  up  to  300uA. 

Test  generation  for  these  faults  (in  all  considered  circuits) 
will  be  considered  after  fault  characterization  is  made.  It 
should  also  be  noted  that  all  simulations  will  be  conducted 
on  4-valued  circuits. 

4  Fault  Characterization  of  the  MVL  Set 

The  literal  operation  will  be  selected  as  an  example  to 
illustrate  the  characterization  steps  conducted.  Similar  work 
has  been  carried  out  for  the  other  four  circuits.  Figure  2 
shows  the  literal  implementation,  found  in  [5],  with  the 
transistors  numbered  as  suggested  before. 


Figure  2.  Literal  transistor  diagram. 

Defects  (shorts  and  opens)  will  be  inserted  one  at  a  time. 
The  circuit  will  then  be  analysed  to  find  out  the  resultant 
behavior  in  the  presence  of  the  fault.  This  step  will  be 
repeated  exhaustively  for  all  defects  (3  shorts  and  3  opens 
per  single  transistor)  to  find  resultant  fault  categories  and 
their  statistics. 

Example  1:  When  the  short  Slgs  is  applied  to  the  literal 
circuit,  it  produced  a  stuck  at  0  (SAO)  fault  for  all  inputs.  The 
same  simulation  is  repeated  for  all  possible  configurations 
of  the  literal,  namely  k1^]1,  k1^]2,  k1^]3,  k2[x]29  k2[x]3, 
k3[z]3  for  all  possible  values  of  £  (1,  2  and  3).  The  fault 
produced  the  same  effect,  SAO,  for  all  combinations  and 
thus  will  be  classified  in  the  SAO  fault  category. 

Table  1  summarizes  the  results  obtained  and  the  fault  cat¬ 
egories  identified  for  the  literal  circuit.  Simulation  showed 
that  the  highest  percentage  of  faults,  40.3%,  resulted  in 


stuck  at  0  output  (SAO).  Sequential  behavior  was  observed 
in  33.8%  of  the  cases.  Functional  faults,  19.5%,  change  the 
circuit’s  behavior  into  a  different  function  from  what  it  was 
designed  for.  The  aSAO  is  an  example  of  this  category. 


Fault  Category 
Sequential 

Faults 

Olg,  02g,  03g,  04g,  04s,  04d,  05g,  07g, 

08g,  08s,  08d,  09g,  09s,  09d,  OlOg,  OlOs, 

OlOd,  01  Ig,  01  Is,  01  Id.  0I2g,  OJ2s, 

012d,  013g,  013s,  013d 

Count 

26 

Percentage 

33.8% 

SAO 

Slgs,  Slsd,  S2gs,  S3gs,  S3gd,  S3sd,  S4gs,  S4gd, 
S4sd,  S5gs,  S6gs,  S7gs,  S8sd,  S8gd,  S9gs,  S9gd. 
SlOgs,  SlOgd.  Sllgs,  S12sd,  S13sd,  01s,  Old, 

02s,  02d,  05s,  05d,  06s,  06d,  07s,  07d 

31 

40.3% 

SAk 

S7sd 

1 

1.3% 

Functional 

aSAO 

bSA3 

bSA3  +  para-k 
Comp,  of  literal 

S2sd,  S2gd,  S8gs,  S9sd,Sllsd, 

S12gs,  Sllgd,  S12gd 

S5sd,  03s,  03d,  S13gs 

S13gd,  SlOsd 

S7gd 

15 

19.5% 

Others 

4 

5.2% 

3*  faults 

S5gd.  S6gd, 

S6sd(kSA3*),  06G(sequential*) 

total  77 

Table  1.  Fault  characterization  of  the  literal 
circuit. 


The  same  procedure  was  used  to  characterize  faults  for 
the  other  circuits  in  the  set,  namely:  the  Complement  of 
literal,  the  Cycle,  the  tSum  and  the  Min. 

The  cycle  characterization  results  are  shown  in  Table  2. 


Fault  Category 
Functional 

Faults 

S2sd,  S2gd,  S3gd,  S4gs,  S4sd,  S4gd,  S5gs,  S5gd, 
S5sd,  S6gs,  S6gd,  S6sd,  S7gs,  S7gd,  S7sd,  S8gs, 
S8gd,  S8sd,  SI Igs,  SI Igd,  SI Isd,  Sl2gs,  S12gd, 
S12sd,  S13gs,  SI3gd  S13sd,  S14gs,  S14gd,  S14sd, 
02s,  02d,  03s,  03d,  05s,  05d,  06s,  06d,  07s, 

07d,  08s,  08d,  09s,  09d 

Count 

44 

Percentage 

53.7% 

Sequential 

Olg,  01s,  Old,  02g,  03g,  04g,  04s,  04d, 

05g,  06g,  07g,  08g,  09g,  01  Og,  01  lg, 

01  Is,  01  Id,  01 2g,  012s,  01 2d,  01 3g, 

013s,  013d,  014g,  014s,  014d 

26 

31.7% 

SA3 

SA3 

SA3* 

Slgs,  Slsd,  S2gs,  S3gs 

S3sd,  SlOgd,  SlOsd 

7 

8.5% 

SAO 

S9gs,  S9sd,  SlOgs,  OlOs,  OlOd 

5 

6.1% 

TWfll  82 


Table  2.  Fault  characterization  of  the  cycle 

circuit. 

The  fault  characterization  conducted  on  the  MVL  set  in 
this  work  resulted  in  4  main  fault  categories: 

1.  Sequential  faults  which  have  sequential  behavior. 

2.  SAO:  the  output  is  stuck-at  zero  level. 

3.  SA3:  the  output  is  stuck-at  level  3.  In  the  case  of 
literals,  this  appeared  as  stuck  at  the  literal  value  (the 
value  of  k). 

4.  Functional  faults  which  change  the  circuit  function 
into  a  different  one  from  what  it  was  designed  for. 

In  addition,  some  other  peculiar  faults  were  found  with  low 
percentages.  The  percentages  of  each  category  against  the 
circuits  studied  are  tabulated  in  Tablestat. 
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Literal 

C  Literal 

Cycle 

tSum 

Min 

Sequential 

33.8% 

33.8% 

31.7% 

24.6% 

28.2% 

Functional 

20.8% 

20.8% 

53.7% 

37.7% 

30.8% 

SAO 

40.3% 

6.5% 

6.1% 

7.5% 

33.3% 

SA3 

1.3% 

35.1% 

8.5% 

26.4% 

7.7% 

Others 

3.9% 

3.9% 

3.8% 

Table  3.  Overall  summery. 

5  Testing  of  the  MVL  Set 

For  a  given  circuit  under  test,  the  aim  of  any  test  genera¬ 
tion  process  is  to  find  the  proper  inputs  that  have  their  fault 
free  outputs  different  from  the  faulty  output(s).  For  example 
the  normal  output  for  x1-  ,  when  x  =  0,  is  3.  To  test  for  SAO 
fault  at  the  output,  0  is  a  valid  test  because  if  it  is  applied,  it 
will  produce  a  fault-free  output  of  3  which  is  different  from 
the  faulty  output  (0).  On  the  other  hand,  x  =  1  is  not  a  valid 
test  because  if  it  is  applied,  it  will  produce  a  fault-free  output 
of  0  which  is  not  distinguishable  from  the  faulty  output.  Se¬ 
quential  faults  need  two  test  vectors  since  the  circuit  output 
under  such  faults  depends  on  both  the  previous  state  and  the 
current  input.  One  vector  is  needed  to  initialize  the  circuit 
and  the  second  vector  to  excite  the  fault. 

For  each  of  the  circuits  studied,  test  vectors  were  gen¬ 
erated  by  simulation  of  each  fault  against  all  possible  input 
variations  (a  sequence  of  two  different  inputs  for  sequential 
faults  and  single  input  for  other  faults).  For  the  circuits 
considered,  lists  of  the  valid  tests  for  every  circuit  under 
all  possible  faults  (shorts  and  opens)  were  generated  by  ex¬ 
haustive  simulation.  This  is  possible  since  the  primitives  are 
relatively  small  (maximum  14  transistors  per  circuit).  From 
these  test  lists,  minimal  sets  or  input  sequences  that  test  for 
all  faults  are  extracted. 

The  Literal  tests  are  selected  for  illustration.  Table  4  lists 
the  faults  and  their  tests  for  the  literal  fc 1  [x] 3 .  For  each  single 
fault  (or  two  equivalent  faults),  there  is  one  corresponding 
entry.  Valid  test  vectors  are  listed  along  the  row.  These  tests 
are  obtained  and  verified  by  simulation.  Sequential  faults 
have  their  tests  listed  as  pairs  of  the  initializing  input  value 
and  the  second  value  as  (xl,  x2).  xl  is  the  first  input  which 
should  be  followed  by  x2  to  excite  the  fault.  Functional 
and  other  non-sequential  faults  need  only  one  input  for  test. 
Valid  tests  are  listed  separated  by  commas.  Entries  marked 
with  (correct  output)  are  for  faults  that  do  not  affect  the 
output  and  for  which  the  circuit  behavior  is  the  same  as  that 
of  the  fault-free  circuit.  Para*  entries  require  the  circuit  to 
be  tested  in  the  whole  specified  range.  For  example,  to  test 
for  the  short  S5gd,  1,2  and  3  are  required.  3*,  ork*,  indicate 
that  3*  testing  procedure  have  to  be  applied  in  addition  to 
the  listed  vectors.  3*  testing  will  be  dealt  with  at  the  end  of 
this  section. 

In  order  to  find  the  minimal  test  set(s)/sequences,  we  have 
applied  the  concept  of  fault  coverage  table.  For  example, 
the  sets  obtained  for  k1  [x]3  are: 


Fault 

Tests 

Fault 

Tbsts 

Fault 

Tests 

Olg 

(0,  1)(0,  2) 
(0,3)  (1.2) 
(1.3X2, 1) 

(2, 3)  (3,1) 
(3,2) 

02g 

(0,  1)(0,  2) 

(0,  3)  (1.0) 

(2,0)  (3.0) 

03g 

correct 

04s  &  Q4d 

(1,0)  (2,0) 
(3.0) 

04g 

correct 

OSg 

correct 

07g 

(1,0)  (2,0) 

(3, 0)  (0, 1) 

(0, 2)  (0, 3) 

08s  &  08d 

(1.0)  (2,0) 

(3.0) 

08g 

(0,  1X0,  2) 

(0,  3)0,0) 

(2,  0)  (3,  0) 

09g 

(0.  1)(0,  2) 
(0,3) 

OlOs  &  OlOd 

(0,  1)  (0,  2) 

(0,  3) 

OlOg 

correct 

Oils  &  Olid 

(0.  1)(0,  2) 
(0.3) 

Ollg 

(0,1)  (0,  2) 

(0.3) 

012s  &  012d 

(1,0)  (2,  0) 

(3.  0) 

012g 

(0,  1)(0,  2) 

(0,  3)  (1,0) 

(2, 0)  (3, 0) 

013s  &  013d 

correct 

013g 

correct 

06g(k*) 

(0,1)  (0,2) 
(0,3) 

5ao 

1,2,3 

SAk 

0 

aSAO 

0 

bSA3 

correct 

S5gd 

ParaJcat: 
all  1,  2&  3 

S13gd&  SlOsd 

Parade  at: 

S6sd 

k*  at  1,2  or  3 

Table  4.  fc'[x]3  Literal  Tests. 


{(0,  3),  (1,  0),  2}  or  {(1,  0),  (0,  2),  3}  or  {(2, 0),  (0,  1),  3} 
or  {(2,  0),  (0,  3),  1}  or  {(3,  0),  (0,  1),  2}  or  {(3,  0),  (0,  2), 
1} 

Using  these  sets,  the  minimal  test  sequence(s)  can  be 
found  as  shown  in  Table  5.  Listed  below  (Table  5),  are  the 
minimal  test  sequences  for  the  6  literal  instances. 


Literal 

Test  Sequence 

fc'W1 

1  _  o  —  l  —  2  -+  1  or 

1_0—  1-+3-*  1 

w 

l  — *  0  — ►  1  — ►  3  — *  1  or  2  — ►  0  — ►  2  — *  3  — ►  2 

l_+0_ ►3_+2orI— i-O— *>2— *-3or2— *0— *  l—Oor 

2  — ♦  0  — *■  3  — ►  1  or  3  — ►  0  — *  l-+2or3— *-0— +2— I 

2  — ►  3  — ►  2  — ►  0  — ►  2  or  2  — ►  3  — ►  2  — ►  1  — ►  2 

3  — ►  1  —  2or  2  — +  1  -►  3 

2  — ►  3  — •-  2 

Table  5.  Minimal  test  sets/sequences  for 
six  literals. 

Similar  steps  were  followed  to  obtain  the  minimal  test 
sets  for  the  other  MVL  circuits.  Table  6  shows  those  for  the 
cycle.  Due  to  space  limitations,  the  test  sequences  for  the 
tsum  and  the  min  are  omitted. 


Cyde 

Test  sequence 

- --j— 

X 

2  — ►  0  — ►  3 

- - 

X 

1  3  _  0or  2  -+  0  —  3 

.3 

X 

2  — ►  3  — *  l  or  2  — *■  3  — ►  0 

Table  6.  Minimal  Test  Sets/ Sequences  for 
the  cycle. 


5.1  Peculiar  Faults  Testing 
Testing  for  3*  Faults 

The  3*  faults  refer  to  any  fault  that  results  in  current  levels 
higher  than  60uA.  Typical  values  found  in  the  simulations 
range  from  80uA  to  300uA. 

If  a  3*  value  is  produced  at  the  output  of  a  faulty  circuit,  it 
may  act  as  an  input  to  another  circuit.  If  the  fault  free  value 
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for  a  3*  fault  is  0,  1  or  2,  then  the  fault  will  be  detected. 
However,  when  the  fault  free  value  is  3,  then  the  behavior 
of  each  circuit  needs  to  be  studied  to  find  out  whether  this 
fault  will  produce  an  observable  effect  on  the  output  or  not. 
The  results  of  this  study  are  summarized  below. 

1.  Literal,  complement  of  literal  and  tSum  gates  will 
read  the  3*  value  as  3  and  their  outputs  will  not  be 
affected.  This  is  due  to  the  current  threshold  elements 
at  the  inputs  of  these  circuits  which  read  any  value  > 
60uA  as  3. 

2.  The  cycle  will  always  produce  different  outputs  for  3 
and  3*  inputs.  Therefore,  the  fault  can  be  detected. 

3.  The  min  will  respond  in  two  different  ways.  First,  if 
one  input  is  3*  and  the  other  input  /  0,  it  will  read  3* 
as  if  it  was  normal  3.  On  the  other  hand,  if  the  input 
was  <  3*, 0  >  (or  <  0, 3*  >)  the  output  will  be  a 
recognized  logic  level  higher  than  1 . 

In  the  previously  generated  test  sequences,  the  following 
should  be  note: 

1 .  If  a  3  *  fault  is  detectable,  the  sequence  will  generate  a 
test  for  it.  For  example,  SA3*  fault  in  the  cycle  can  be 
detected  by  tests  that  produce  a  different  output  from 
3. 

2.  If  the  3*  fault  can  not  be  detected,  the  test  is  designed 
to  excite  the  fault.  For  example,  the  sum*  fault  cat¬ 
egory,  in  the  tsum,  produce  un-truncated  sum.  For 
inputs  that  have  a  sum  <  3,  the  circuit  behaves  like 
the  fault- free  circuit.  For  higher  inputs,  the  3*  output 
will  be  produced.  So,  the  tests  are  designed  to  excite 
the  3*  output. 

The  circuits  that  produce  un-detectable  3*  faults  are:  lit¬ 
eral,  complement  of  literal,  and  tSum.  For  the  literals,  if  the 
k  is  1  or  2  and  a  3*  output  is  produced,  it  will  be  detected 
since  it  will  produce  different  logic  level,  3.  For  literals 
when  k  is  set  to  3  and  the  tSum,  special  propagation  pro¬ 
cedure  needs  to  be  taken  when  the  correct  output  is  3  and 
the  faulty  output  is  3*  (this  happens  when  x  in  [a,  b]  for  the 
literal,  x  outside  [a,  b]  for  the  complement  of  literal,  and 
x\  +  X2  >  3  for  the  tSum).  The  recommended  procedure  is 
shown  below. 

1 .  If  the  subsequent  circuit,  whose  input  is  connected  to 
the  3  *,  is  literal,  complement  of  literal  or  tSum,  it  will 
not  be  affected  and  will  produce  correct  values.  In 
this  case,  the  error  will  be  redundant. 

2.  If  the  subsequent  circuit  is  a  cycle,  it  will  always 
produce  faulty  output  which  can  then  be  propagated 
to  an  observable  output. 


3.  If  the  subsequent  circuit  is  a  min,  then  the  test  algo¬ 
rithm  should  propagate  the  expected  3*  twice.  Once 
with  the  other  min’s  input  set  to  0  and  once  when  it 
is  set  to  3.  The  0  will  detect  a  3*  fault  and  the  3  will 
detect  other  faults. 

The  fault  coverage  figures  for  the  generated  test  se¬ 
quences  in  circuit  are  tabulated  in  Table  7.  Two  figures 
are  given.  The  first  assumes  that  the  above  procedure  of  3* 
testing  is  applicable.  The  second  is  when  the  3*  procedure 
is  not  applied. 


nsn 

#  of  un-deteetable 

3*  faults 

Coverage,  3* 
test  not  applicable 

Literal 

S6gd  <Osc.) 

98.7% 

98.7 -(2.6  *  a) 
98.7%  if  a  =  0 
96.1%  if  o  ~  1 

Corap.  of 
literal 

S6gd  (Osc.) 

98.7% 

98.7 -(2.6*  a) 

98.7%  if  =  0 
96.1%  if  a  =s  1 

HSE9MI 

0 

0 

100% 

HI 

96.2% 

75.4% 

Min 

• 

100% 

0 

100% 

Table  7.  Fault  coverage  figures  and  3*  test 
ing. 


o,  in  Table  7  is  the  ratio  of  the  number  of  lit¬ 
eral/complement  of  literal  gates  that  are  designed  with  k 
=  3  compared  to  the  number  of  literal/complement  of  literal 
gates  in  the  whole  network. 

Oscillations  Fault 

Under  this  fault,  the  circuit’s  output  oscillates  over  the  whole 
range  from  0  to  60u  A  with  high  frequency.  It  is  caused  by  the 
a  continuous  charging/discharging  paths  affecting  the  gate 
voltage  (N rej)  of  transistors  4,  5  and  6.  The  charging  path 
sources  from  the  output  current  (when  the  switch  transistor 
7  is  open)  through  the  short  to  the  gate  of  transistor  6  (which 
is  connected  to  the  gates  of  4  and  5).  The  discharging  path 
(when  transistor  7  is  closed)  passes  through  the  short  to  the 
grounded  source  of  transistor  6.  This  fault  is  not  covered 
by  the  test  sequences  derived  earlier.  However,  only  one 
short  produced  this  type  of  fault  (S6gd  in  the  literal  and 
complement  of  literal,  and  S3gd  in  the  tSum)  which  is  a  low 
percentage  (1.3%  for  the  literals  and  1.9%  for  the  tSum). 

This  can  be  solved  for  the  Literal  and  Complement  of 
Literal  if  the  positions  of  transistors  6  and  7  (Figure  2)  are 
swapped  (a  Design  for  Testability  (DFT)  issue).  In  this 
case,  the  fault  will  be  converted  to  a  normal  fault  that  can 
be  detected  by  the  test  sequences.  This  was  not  possible  for 
the  tSum. 

A  Soft  Error 

One  short,  S6gd  in  the  tSum,  resulted  in  a  slightly  different 
output  from  the  fault  free  circuit.  It  only  changes  the  cur¬ 
rent  value  of  logic  3  (normally  60uA)  into  an  intermediate 
level  (50uA)  between  2  and  3.  This  can  be  read  as  3,  and 
passes  undetected,  or  as  2  which  can  be  detected.  This  can 
not  be  determined  and  is  considered  uncovered  by  the  test 
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sequences  produced. 

6  Recommendations  for  Testability  of  MVL 
Circuits 

Based  on  the  results  obtained  in  this  work,  the  following 
general  recommendations  can  be  made  for  the  set  of  MVL 
circuits  considered: 

1 .  Literals  and  tSum  should  not  be  used  as  output  stages. 
This  is  to  avoid  3*  values  at  the  output  where  its  effect 
can  propagate  to  another  circuit.  Usually,  literals  ap¬ 
pear  at  the  input  stages  and  do  not  appear  afterwards. 

2.  If  a  tSum  has  to  be  used  at  the  output,  it  should  be 
followed  by  2  consecutive  cycle  gates.  This  will  guar¬ 
antee  the  detection  of  the  3*  faults  if  they  appear  at 
the  tSum  output. 

3.  Apply  3*  test  procedure  for  every  min  preceded  by 
literals  or  tSum.  This  is  to  properly  propagate  possible 
3*  values  generated  by  the  literals  or  the  tSum. 

4.  For  the  Literal  and  Complement  of  Literal  gates,  swap 
the  positions  of  transistors  6  and  7.  This  will  eliminate 
the  oscillation  fault  caused  by  the  S6gd  short  and  will 
convert  it  to  a  testable  fault. 

7  Concluding  Remarks  and  Future  Work 

In  this  paper,  fault  characterization  and  testability  con¬ 
siderations  in  a  set  of  MVL  basic  gates  were  considered.  To 
achieve  this,  faults  (shorts  and  opens)  were  inserted  one  at 
a  time  and  each  circuit  was  simulated  for  all  possible  input 
transitions  to  find  out  which  inputs  are  valid  to  test  for  the 
fault  inserted.  The  process  was  repeated  for  each  circuit  for 
all  its  faults  and  the  resultant  test  vectors  for  all  faults  were 
tabulated.  Then  these  tables  were  used  to  generate  minimal 
test  sequences  for  each  circuit. 

The  main  conclusions  that  can  be  derived  from  this  work 
are: 

1 .  The  stuck-at  fault  model  is  inadequate  for  representing 
faults  in  the  MVL  circuits.  Actually,  it  represented 
less  than  40%  of  the  total  faults. 

2.  High  percentage  of  faults  appeared  as  functional 
faults,  which  change  the  circuit  output  into  differ¬ 
ent  function.  They  ranged  from  20%  in  the  literals  to 
more  than  50%  in  the  cycle. 

3 .  Only  S  AO  or  S  A3  faults  appeared.  There  are  no  stuck- 
at  faults  at  the  other  logic  levels  1  or  2  (stuck-at  the 
literal  value,  in  the  literals,  is  a  special  case  of  SA3). 


After  using  the  testability  recommendations  in  this  work, 
the  given  test  sequences  can  be  used  in  testing  larger  cir¬ 
cuits.  The  authors  are  currently  working  on  the  issue  of  pre¬ 
computed  test  sets  for  circuits  using  the  considered  MVL 
set  as  basic  building  blocks. 
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Abstract 

In  this  paper  we  show  how  Boolean  Ring  logic,  a  group- 
based  logic,  leads  to  a  circuit  implementation  that  is  highly 
testable.  We  develop  Boolean  Ring  based  expressions,  which 
we  call  Generalized-Literal  Boolean-Ring  Sum-of-  Products 
(GL-BRSOP)  and  Universal-Literal  Boolean-Ring  Sum-of- 
Products  (UL-BRSOP)  to  represent  (powers  of  2)-valued 
MVL  functions.  Our  BRSOPs:  allow  MVL  functions  to  be 
implemented  with  vectors  of  binary  AND  and  EXOR  gates; 
allow  the  use  of  a  binary  logic fault  model;  and  allow  a  time- 
multiplexed  implementation  that  is  highly  testable  and  uses 
less  redundant  circuitry. 

1.  INTRODUCTION 

Boolean  Ring  is  a  group-based  logic  system  like  Galois 
Fields.  A  Galois  Field,  GF(kr),  is  a  set  where  k  is  a  prime 
number  and  r  is  a  positive  integer  [3, 13].  AND-EXOR  logic 
systems  correspond  to  the  2-valued  Galois  Field,  GF(2).  It 
is  well  known  that  binary  AND-EXOR  circuits  are  easily 
testable.  Researchers  have  introduced  highly  testable  imple¬ 
mentations  and  test  sets  for  various  AND-EXOR  forms  such 
as  PPRM,  FPRM,  GRM,  and  ESOP  [16,  17,  19,  20].  The 
testability  of  multiple- valued  Galois  Field  logic  circuits  has 
also  been  investigated.  Dubrova  investigated  GF(A-)  circuits 
[6],  where  k  is  a  prime  number;  and  in  [1 1],  we  proposed 
a  highly  testable  realization  and  a  testing  scheme  for  the 
most  general  functional  representation,  Galois  Field  Sum- 
of-  Products  (GFSOP),  over  GF(fcr),  where  r  is  equal  to  or 
greater  than  1. 


Figure  1.  GF(4)  addition  and  multiplication  tables. 


The  high  testability  of  GF  circuits  is  due  to  the  fact  that 
the  GF  operators  exhibit  the  cyclic  group  property.  This 
property  can  be  explained  from  the  4-  valued  (quaternary) 
GF  operators  as  shown  in  Figure  1.  Note  that  in  any  row  of 
the  addition  table  in  Figure  1,  the  elements  are  all  different, 


which  is  cyclic,  and  that  the  elements  have  a  different  order 
in  each  row.  Another  cyclic  group  can  be  observed  in  the 
multiplication  table.  If  the  zero  element  (shaded  in  gray)  is 
removed  from  the  multiplication  table,  then  the  remaining 
elements  form  a  cyclic  group.  In  binary,  the  GF(2)-addition 
operator,  EXOR,  has  the  cyclic  group  property.  This  char¬ 
acteristic  is  useful  in  detecting  faults  because  any  single 
change  in  the  inputs  of  the  gate  will  cause  a  change  in  the 
output. 

Our  work  with  GF (kr)  circuits  led  us  to  investigate  other 
logic  systems  that  exhibit  the  cyclic  group  property.  One 
such  logic  system  is  Min-ModSum,  which  was  invented  by 
Dueck  and  Miller  [7],  The  operators  of  Min-ModSum,  Min 
and  ModSum,  perform  the  Minimum  operation  and  Modulo- 
Summation,  respectively.  The  testability  of  Min-ModSum 
circuits  is  similar  to  that  of  multiple-valued  GF  circuits  be¬ 
cause  ModSum  forms  a  cyclic  group  with  all  the  elements  of 
the  logic.  However,  Min  does  not  have  any  cyclic  group,  so 
the  test  set  is  not  superior  to  that  of  GFSOPs  given  in  [1 1]. 
Remember  that  GF-multiplication  has  a  cyclic  group  when 
the  zero  element  is  excluded. 


Group-based  Logic  Families 


Galois  Fields  Min-ModSum  Logic  Boolean  Rings 
[7]  [5,14*21] 


GF(2)  GF(£)  GF(/0 

Binary  AND-EXOR  [6]  [11] 

[16-20] 


Figure  2.  The  group-based  logic  families. 


The  contribution  of  this  paper  is  the  analysis  of  Boolean 
Ring  [5,  14,  21],  BR(2r),  as  another  group-based  multiple¬ 
valued  logic  with  2r  elements,  where  r  is  a  positive  inte¬ 
ger  number.  Figure  2  summarizes  the  currently  investigated 
group-based  logic  systems.  One  major  advantage  of  BR  is 
that  it  allows  MVL  functions  to  be  implemented  in  space 
with  vectors  of  binary  AND  and  EXOR  gates.  As  we  show 
later,  when  implemented  in  time,  BR  also  leads  to  highly 
testable  circuits  with  efficient  test  sets.  This  fact  was  not  re¬ 
alized  by  previous  researchers.  The  implementation  in  time 
is  realized  by  using  registers  and  by  time-multiplexing  the 
circuit.  Our  work  has  significant  practical  implications  be¬ 
cause  BR  circuits  can  be  implemented  with  very  fast  opti- 
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cal  realizations  [9].  Also,  some  recent  patents  emphasize  a 
time-multiplexed  logic  that  can  be  used  to  implement  BR 
functions  realized  in  FPGAs  [12,  22]. 

The  organization  of  this  paper  is  as  follows.  In  Section  2, 
we  introduce  BR  functional  representations  and  demonstrate 
the  functional  completeness  of  these  forms.  In  Section  3, 
we  show  highly  testable  implementations  of  these  forms  and 
give  an  efficient  testing  scheme.  In  Section  4,  we  give  our 
conclusions  and  plans  for  future  work. 

2.  BR(2r)  FUNCTIONAL  REPRESENTA¬ 
TIONS 

2.1.  Binary  Vector  Interpretation  of  Boolean  Ring 

In  this  section,  we  will  define  a  new  interpretation  that 
uses  binary  gates  to  implement  arbitrary,  T -valued  func¬ 
tions.  In  this  paper,  we  assume  that  each  MVL  variable  is 
represented  by  r  binary  wires,  or  rather,  an  ordered  binary 
vector  with  r  elements.  This  scheme  allows  us  to  imple¬ 
ment  the  circuit  with  vectors  of  binary  gates  and  to  simply 
increase  the  size  of  the  vector  linearly  as  the  radix  is  in¬ 
creased.  As  an  example,  Table  1  shows  the  representation 
(encoding)  for  quaternary  logic,  the  case  where  r  =  2. 


00 

If)  I 

01 

1 

10 

2 

11 

3 

Table  1.  The  binary  encoding  for  quaternary  logic  levels. 

Most  multiple-valued  systems  have  two  operators: 
addition-like  and  multiplication-like.  The  multiplication¬ 
like  operator  is  used  to  cut  a  portions)  of  the  entire  func¬ 
tion,  and  the  addition-like  operator  is  used  to  combine  these 
sub-functions  to  create  the  complete  function.  Therefore,  in 
any  complete  logic  system,  a  function  can  be  represented 
in  Sum-of-Products  form,  or  rather,  the  combination  of  sub¬ 
functions.  In  GF,  for  example,  the  GF-multiplication  imple¬ 
ments  the  cutting  function,  and  the  GF-addition  implements 
the  combining  function.  (Hozumi,  et  al.,  have  shown  the  ex¬ 
istence  of  other  choices  to  implement  the  cutting  and  com¬ 
bining  functions  in  [10].)  Since  our  objective  was  to  have  a 
highly  testable  implementation,  we  felt  it  very  important  to 
select  operators  that  have  some  form  of  a  cyclic  group. 

It  can  be  shown  from  the  axioms  of  Boolean  Ring  that 
a  vector  of  r,  binary  EXOR  gates  can  be  used  to  imple¬ 
ment  the  BR-addition  operation  with  our  vector  approach 
and  the  encoding  in  Table  1.  As  explained  earlier,  the  binary 
EXOR  gate  exhibits  the  cyclic  group  property  and  thus  leads 
to  easily  testable  circuit  implementations.  Figure  3  shows 
the  vector  of  EXOR  gates  for  a  quaternary  logic  function 
with  multiple- valued  input  variables  A  and  B.  Note  that  each 
input  variable  and  the  output  are  represented  by  two  wires. 


For  MVL  with  2r_3  =  8  values,  three  EXOR  gates  would  be 
required  for  the  vector. 


Figure  3.  The  vector  of  EXORs  that  implements  the  BR- 
addition  in  quaternary  logic. 

Definition  1:  ”®0”  represents  the  function  realized  by  a 
vector  of  EXORs,  which  we  will  call  the  BR-addition  op¬ 
erator.  (The  two  EXOR  symbols  imply  the  vector,  not  the 
number  of  EXOR  gates  used  in  the  realization  of  the  opera¬ 
tor.) 

Example:  BR-addition  of  two  quaternary  constants: 
2003  =  [I0]bt'norj/®®[1  l]w nary  ~  [Oljbjnary  —  !• 

Figure  4  shows  the  truth  table  for  the  vector  of  EXORs, 
both  in  binary  and  MVL  form.  By  comparison  with  Figure 
1,  it  can  be  seen  that  this  circuit  implements  the  4- valued 
GF-addition. 
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Figure  4.  The  truth  table  for  the  vector  of  EXORs  and  the 
corresponding  quaternary  table. 

To  implement  the  cutting  operation,  let’s  investigate  the 
properties  of  a  vector  of  ANDs  such  as  that  shown  in  Figure 
5.  The  truth  table  in  Figure  6  shows  that  our  vector  AND 
circuit  does  not  implement  the  GF-multiplication  operator. 
Furthermore,  it  does  not  exhibit  the  cyclic  group  property. 
However,  similar  to  the  Min  operator  in  Min-ModSum  logic, 
it  has  the  0  (zero)  element  and  the  1  (multiplicative-identity) 
element  of  a  logic  system.  In  quaternary,  these  are  ‘0’  for 
the  0  element,  and  ‘3’  for  the  1  element. 

Definition  2:  ’’represents  the  function  realized  by  a 

vector  of  ANDs,  which  we  will  call  the  BR-multiplication 
operator. 

Example:  BR  multiplication  of  two  quaternary  constants: 
2**3  =  [  1 0]t;>!ar-y**[l  1  [binary  =  [  1 0]f>inary  —  2. 
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Figure  6.  The  truth  table  for  the  vector  of  ANDs  and  the 
corresponding  quaternary  table. 


2.2.  Boolean  Ring  Sum-of-Products  Forms 

Definition  3:  A  literal  is  a  one-variable  function  that  has 
the  minimum  or  the  maximum  value  of  the  logic  system  as 
its  value.  It  has  the  maximum  value  for  the  input  values 
specified  as  the  left  superscript,  and  the  minimum  value  for 
the  remaining  input  values. 

Definition  4:  If  a  literal  is  in  a  simple  form,  and  has  only 
one  input  value  specified  in  its  left  superscript,  then  the  lit¬ 
eral  is  called  a  Post  literal  or  the  characteristic  function  of 
a  variable.  For  example,  in  ternary  logic,  a  Post  literal  °x 
takes  the  value  ‘2’  for  x  =  0,  and  ‘O’  for  the  remaining  input 
values  (‘  1  ’  and  ‘2’). 

Definition  5:  If  a  literal  has  more  than  one  input  value  in 
its  left  superscript,  then  the  literal  is  called  a  generalized  lit¬ 
eral.  For  example,  in  quaternary  logic,  a  generalized  literal, 
1'2x  takes  the  value  ‘3’  for  x  =  1  and  x  =2,  and  takes  the 
value  ‘O’  for  the  other  input  values,  x  =  0  and  x  =  3.  Note 
that  the  Post  literal  is  a  special  case  of  the  generalized  literal. 

Definition  6:  A  more  complex  literal,  universal  literal, 
is  defined  by  Fraser  and  Dueck  in  [8]  as  follows.  Let  Xi 
be  a  variable  that  can  assume  any  of  the  logic  values  in  the 
set  {0,  1,2,.. .,  (AM)},  where  N  is  the  radix.  Let  X  = 
{zi,  X2,  ■  ■  ■,  x„)  be  a  set  of  n  variables.  A  universal  lit¬ 
eral  is  a  one-variable  function  <a0“i  •  ■  where 

<a0ai . .  ,a^f-i>Xi  =  aXi.  For  example,  in  quaternary 
logic,  a  universal  literal  <  1003  >  A  of  a  quaternary  vari¬ 
able  A  takes  the  value  ‘  1  ’  for  A  =  0;  takes  the  value  ‘O’  for 
A  =  1  and  A  =  2;  and  takes  the  value  ‘3  ’  for  A  =  3  according 
to  the  definition. 

The  relation  of  complexity  among  the  different  types  of 
literals  in  our  context  can  be  illustrated  as  in  Figure  7.  The 
universal  literals  have  the  most  complexity  among  the  three. 

Assuming  a:  is  an  MVL  variable,  the  following  axioms 
can  be  observed  for  the  BR  multiplication  operator: 

1.  x  ••  0  =  0  =>  x  —  0  =  0  (in  quaternary) 

2.  x  —  l-x  =>  x  —  3  =  x  (in quaternary) 


Universal  Literals 

*1 Generalized  Literals' " 
(Tost  Literals^ 


Figure  7.  The  inclusion  of  the  complexity  of  the  literals. 


These  properties  allow  us  to  construct  our  sub-functions 
(product  terms)  based  on  literals,  rather  than  variables  or 
powers  of  variables  as  in  GF  product  terms.  The  practical 
result  is  that  BR  logic  functions  can  be  expressed  with  just 
constants,  literals  and  BR  vector  operators. 

Definition  7:  A  Boolean  Ring  form,  Generalized-Literal 
Boolean-Ring  Sum-of-Products  (GL-BRSOP),  can  be  de¬ 
fined  as: 

/  =  (Ci—Xn—Xi2—  •  *  *  ••Ein)®© (C2—X2I—X22—  ■  *  • 
••Z2n)®®  '  *  •Q®(Cp—Xpi—Xp2—  *  '  •  —Xpn), 

where  it  is  assumed  that  there  are  n  variables  and  p  prod¬ 
uct  terms;  c,-  is  a  constant  such  that  1  <  Cj  <(2r-l);  Xij 
is  a  generalized  literal  of  the  jth  variable  in  the  ith  prod¬ 
uct  term,  and  it  can  be  a  different  literal  of  the  same 
variable  at  each  product  term.  For  a  quaternary  vari¬ 
able,  ,  can  be  °x,  0,1x,  2i3x,  etc.  The  function,  /  - 
(2##ixi«#o'3x2)e0(l^1,2,%i^2)00(2^<:),2xi«*3x2),  is 
an  example  of  a  GL-BRSOP  expression.  Notice  that  the  ex¬ 
pression  can  contain  both  Post  literals  and  generalized  liter¬ 
als.  If  only  Post  type  literals  are  used  in  the  product  terms 
(minterms),  the  GL-  BRSOP  expression  is  a  canonical  form 
analogous  to  the  binary  Minterm  Canonical  Form.  Other¬ 
wise,  the  expression  is  non-canonical  analogous  to  the  bi¬ 
nary  SOP  expression.  Our  next  step  is  to  prove  the  general¬ 
ity  of  our  formulation. 

Theorem  1:  Every  multiple- valued  logic  function  with  2r 
logic  values  can  be  realized  in  Generalized-Literal  Boolean- 
Ring  Sum-of-Products  (GL-BRSOP)  form. 

Proof:  Like  binary  SOP  expressions,  a  GL-BRSOP  ex¬ 
pression  can  be  written  by  BR-adding  the  minterms  that  are 
constructed  for  each  entry  in  the  truth  table  where  the  func¬ 
tion  is  not  zero.  In  such  case,  the  BRSOP  expression  is  non 
minimal,  and  based  on  Post  literals.  A  minterms  is  con¬ 
structed  such  that  it  gets  the  value  of  the  function  in  the  cor¬ 
responding  entry.  The  minterm  construction  is  done  by  BR- 
multiplying  a  constant  and  all  the  Post  literals  constructed 
for  the  entry.  Since  each  Post  literal  in  the  minterm  gets  the 
value  of  the  1  element  for  the  entry,  the  multiplication  of 
them  gives  the  1  element.  Therefore,  the  constant  is  chosen 
such  that  the  minterm  gets  the  value  of  the  function  for  that 
entry.  As  a  result,  the  BR-addition  of  all  the  minterms  con¬ 
structed  in  this  manner  realizes  the  entire  function  because 
the  expression  gets  the  value  of  the  minterm  for  the  corre¬ 
sponding  entry  where  all  other  minterms  get  the  value  of  the 
zero  element.  a 

Example:  We  will  use  the  quaternary  MVL  function 
whose  truth  table  is  shown  in  Table  2  to  illustrate  the  con¬ 
struction  of  a  GL-BRSOP  expression. 

For  each  entry  that  does  not  have  a  ‘O’  output,  we  write 
a  Post  literal  based  minterm  using  BR-multiplication  and 
then  combine  these  minterms  using  BR-addition  to  gener¬ 
ate  the  BRSOP  expression.  For  the  first  entry  in  the  truth 
table,  the  minterm  is  Mi  =  (2##°A##15).  Without  the  con¬ 
stant  coefficient,  the  minterm  value  is  the  1  element  (=3) 
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Table  2.  A  quaternary  MVL  function. 


for  the  input  values  in  the  first  entry  because  each  literal 
in  the  minterm  gets  the  value  of  the  1  element  ( °A—  lB  = 
3~3  =  3).  However,  we  want  the  value  of  the  minterm 
to  be  ‘2’  to  implement  the  example  function  for  this  en¬ 
try.  Therefore,  the  multiplication  with  constant  ‘2’  gives 
the  correct  output  (2~3  =  2).  Obviously,  this  correction 
is  not  necessary  for  the  minterms  where  the  desired  result 
is  ‘3’  (the  1  element).  After  obtaining  all  the  minterms  in 
the  same  manner,  the  GL-BRSOP  expression  that  represents 
the  entire  function  is:  F  =  Mi0®M2®0M300M4  = 
(2—  °A—  1B)0©(  2—  1A~2B)®®(  94~1£)ee(l#*  3a—3b). 

Definition  8:  A  Boolean  Ring  non-canonical  ex¬ 
pression,  Universal-Literal  Boolean-Ring  Sum-of-Products 
(UL-BRSOP),  can  be  defined  as: 

/  =  (Ci—Xn—Xi2—  ‘  *  •  ••2ln)®0(c2~Z21#*£22#*  •  •  • 
~Z2n)®0  *  ‘  -m>(Cp—Xpi—Xp2~  ’  •  —Xpn), 

where  it  is  assumed  that  there  are  n  variables  and  p 
product  terms;  c*  is  a  constant  such  that  1  <  c*  <(2r- 
1);  Xij  is  a  universal  literal  of  the  jth  variable  in  the 
ith  product  term,  and  it  can  be  a  different  literal  of  the 
same  variable  at  each  product  term.  For  example  /  = 
(<2100>xi~  <0331>z2)ee(<0330>zi~  <1123>z2)  is 
a  quaternary  UL-BRSOP  expression.  Notice  that  in  contrast 
to  GL-BRSOP,  the  expression  does  not  contain  constant  co¬ 
efficients  because  a  universal  literal  can  have  any  value. 

UL-BRSOP  expressions  are  also  functionally  complete 
since  they  include  GL-BRSOP  expressions.  We  will  give 
the  following  example  to  illustrate  how  universal  literals  can 
be  used  to  express  an  MVL  function  in  BR. 

Example:  Let’s  find  a  UL-BRSOP  expression  for  the  2- 
variable  quaternary  function  below: 
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Table  3.  A  quaternary  function. 

Since  UL-BRSOP  is  not  a  canonical  form,  the  solu¬ 
tion  we  give  below  is  just  one  of  the  possible  solutions  to 


the  problem.  Searching  for  the  minimum  solution  to  the 
problem  could  be  done  with  a  heuristic  scheme  similar  to 
that  described  in  [8],  or  the  approximate/exact  backtrack¬ 
ing  minimization  scheme  that  was  proposed  in  [23]  and 
extended  in  [15].  Other  algorithms  such  as  Greedy,  Ge¬ 
netic,  Tabu-Search,  and  Simulated  Annealing  could  also 
be  adapted.  Our  solution  for  the  function  is:  F  — 
(<0233>A~<0013>£)ee(<0330>A~<0330>£).  Note 
that,  unlike  a  GL-BRSOP,  the  solution  does  not  contain  a 
minterm  for  each  entry  in  Table  3.  The  easiest  way  to  verify 
this  solution  is  by  looking  at  the  truth  tables  for  the  univer¬ 
sal  literals  of  the  expression  and  performing  the  indicated 
BR  vector  operations  on  a  bit-by-bit  basis  for  each  entry  as 
shown  in  Figure  8.  For  example,  if ‘2’  ([10]{>inan/)  and  ‘3’ 
([1  ^binary)  are  vector- ANDed  (••)  together,  then  the  result 
is  ‘2  ([10]t>inary). 


Figure  8.  The  functional  verification  of  the  example  UL- 
BRSOP  expression. 


3.  EASILY  TESTABLE  IMPLEMENTA¬ 
TIONS 

3.1.  The  BRSOP  Implementations  in  Space 

To  illustrate,  we  will  show  how  our  earlier  GL-BRSOP 
example,  F  =  {2-°A-lB)m)(2-1A-2B)m{3A-1B) 
®©(1.*94.*3B)  can  be  implemented  with  binary  gates.  Fig- 
ure  9  shows  a  high  level  MVL  schematic  of  the  implementa¬ 
tion.  The  BR  operators,  the  vector  of  ANDs  and  the  vector 
of  EXORs,  are  symbolized  using  a  black  bar  inside  the  con¬ 
ventional  AND  and  EXOR  symbols.  The  vector  network  can 
be  implemented  either  as  a  tree,  or  as  a  cascade  as  shown  in 
Figure  9.  Our  purpose  in  choosing  the  cascade  architecture 
will  be  explained  in  Section  3.3. 

The  binary  implementations  for  required  literals  can  be 
found  directly  from  their  definitions.  The  truth  table  and  cir¬ 
cuit  in  Figure  1 0,  for  example,  show  how  to  obtain  the  literal 
°A  for  quaternary  variable  A.  If  only  Post  literals  are  used  in 
the  implementation,  the  literals  can  always  be  implemented 
using  a  single  AND  gate  with  NOT.  The  reason  is  that  there 
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Figure  9.  The  MVL  schematic  tor  the  BR  logic  implementa¬ 
tion  in  space. 


is  only  one  input  combination  that  would  make  the  Post  lit¬ 
eral  have  the  value  of  the  1  element.  Using  the  more  com¬ 
plex  generalized  literals  to  obtain  the  function  may  produce 
fewer  product  terms  in  the  GL-BRSOP  expression,  but  the 
binary  implementation  of  these  complex  literals  may  require 
multiple  gates,  such  as  AND,  OR,  EXOR,  XNOR,  and  NOT; 
with  multiple  levels.  For  instance,  an  8-valued  literal  lj6x 
has  the  value  of  the  1  element  for  two  input  combinations: 
[001]6inary  and  [1  \0]binary ,  which  requires  two  AND  gates 
in  the  first  level  and  one  OR  gate  in  the  second  level  of  the 
binary  implementation  of  this  literal. 
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Figure  10.  The  binary  implementation  of  a  Post  type  literal. 


The  first  minterm  (Mi)  of  the  example  function  can  be 
implemented  as  shown  in  Figure  11a.  The  minterms  pro¬ 
duced  in  this  manner  are  then  combined  with  a  vector  of 
binary  EXOR  gates.  Note  that,  as  is  often  the  case,  multi¬ 
plication  by  a  constant  simplifies  final  minterm  implemen¬ 
tation.  This  is  illustrated  in  Figure  1  lb  where  the  literals  are 
multiplied  by  the  constant  ‘2\ 


Figure  11.  A  product  term  realization  before  and  after  con¬ 
stant  multiplication. 


Figure  12  shows  our  complete  binary  implementation  for 
the  example  function,  F  (f\  and  /o).  Notice  the  repetition 
(r  times,  log2  [radix])  in  the  binary  AND-EXOR  network 
for  each  bit  of  a  variable.  This  property  simplifies  routing 
because  r-tuples  of  wires  representing  the  same  variable  are 


always  routed  together,  thus  the  number  of  connections  ac¬ 
tually  routed  is  minimized.  Furthermore,  the  circuit  can  be 
simplified  for  constant  inputs  before  realization  as  shown  in 
Figure  1  lb. 


Vector  Network 


Figure  12.  The  quaternary  example  function  implemented 
in  BR  logic. 


As  mentioned  earlier,  the  variable-to-literal  conversion 
network  requires  only  single  gates,  if  Post  type  literals  are 
used.  However,  for  complex  functions,  the  number  of  gates 
required  to  implement  the  function  using  such  literals  may 
become  excessive  since  this  kind  of  circuit  requires  creating 
a  minterm  for  each  entry  in  the  truth  table.  This  problem 
can  be  overcome  by  using  more  complex  generalized  liter¬ 
als.  More  complex  generalized  literals  reduce  the  number 
of  product  terms  at  the  cost  of  increased  complexity  in  the 
Variable-to-Literal  conversion  circuitry,  but  in  most  cases, 
it  leads  to  a  general  reduction,  especially  for  large  circuits. 
Therefore,  it  is  reasonable  to  assume  synthesis  with  the  most 
complex  literals  possible,  i.e.  universal  literals. 

Universal  literals  can  be  realized  with  binary  gates  in  the 
same  manner  as  the  generalized  literals  explained  earlier. 
Figure  13  shows  the  implementation  of  the  universal  literal 
<0221>A  Note  that  the  realization  requires  a  more  com¬ 
plex  realization  than  would  a  Post  literal.  For  higher  radix 
values,  universal  literal  realization  may  even  require  multi¬ 
ple  level  networks. 


Figure  13.  The  binary  implementation  for  a  quaternary  uni¬ 
versal  literal. 


As  a  complete  example.  Figure  14  shows  the  imple¬ 
mentation  for  our  earlier  UL-BRSOP  expression  F  = 
(<0233>A##<0013>5)ee(<0330>A~<0330>£). 
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Vari able  to  Literal  Vector  Network 


Figure  14.  The  UL-BRSOP  implementation  in  space. 


The  operators  of  BR  logic  are  somewhat  similar  to  those 
of  Min-ModSum  logic  [7],  but  in  contrast  to  all  other  MVL 
systems,  they  can  be  time-multiplexed  as  will  be  explained 
in  the  following  section. 

3.2.  The  BRSOP  Implementations  in  Time 

Unlike  most  other  multiple- valued  logic  systems,  our  in¬ 
terpretation  of  Boolean  Ring  logic  can  be  time-multiplexed 
to  improve  its  testability.  To  see  the  rationale  for  this,  first 
note  that  in  the  BRSOP  realizations  shown  in  Figure  12  and 
14,  there  is  a  repetition  of  r  times  in  the  vector  network  sec¬ 
tion.  The  vector  network  essentially  implements  multiple 
Positive  Polarity  Reed  Muller  (PPRM)  networks.  Also,  note 
that  these  networks  function  in  parallel,  independently  from 
each  other.  Therefore,  as  we  will  illustrate  here,  a  single 
PPRM  network  can  be  time-  multiplexed  with  the  aid  of  shift 
registers,  which  also  function  as  scan  registers  during  test. 
Other  time-multiplexing  schemes  [2, 12, 22]  can  also  be  uti¬ 
lized  for  this  purpose.  Figure  15  shows  the  time-multiplexed 
version  of  the  circuit  in  Figure  12.  Since  the  PPRM  network 
is  time  shared,  there  is  only  one  PPRM  network  instead  of  r 
PPRM  networks  as  in  the  vector  network. 


Scan-In 


Figure  15.  The  binary  schematic  for  the  time-multiplexed 
version  of  the  GL-BRSOP  implementation. 


The  scan  registers  have  two  distinct  modes  of  operation: 
normal  mode  and  test  mode.  During  the  normal  mode  of  op¬ 
eration,  the  literal  values  coming  from  the  conversion  part 
are  loaded  into  the  registers  in  parallel  on  the  first  clock 
pulse.  The  literal  bits  for  the  binary  function  f0  are  then 
applied  to  the  PPRM  network  and  the  result  propagates  to 
the  output  register.  On  the  second  clock  pulse,  the  result 
for  function  /0  is  serially  loaded  into  the  output  register  and 
the  literal  bits  for  function  fi  are  shifted  into  the  inputs  of 
the  PPRM  network.  On  the  third  clock  pulse,  the  result  for 
function  f\  is  serially  loaded  into  the  output  register.  At  this 
point  all  the  bits  for  the  MVL  function  F  (/i  and  /o)  can  be 
read  in  parallel  from  the  output  register. 

33.  The  Testing  Scheme 

During  the  test  mode,  all  scan  registers  act  as  a  single 
shift  register  connected  as  a  chain.  The  scan  registers  assist 
in  testing  because  they  isolate  the  conversion  network  from 
the  PPRM  network,  enabling  these  parts  to  be  tested  individ¬ 
ually.  Test  vectors  are  shifted  into  the  scan  chain  from  the 
Scan-In  input  to  apply  them  to  the  PPRM  network;  and  the 
results  are  collected  from  the  primary  circuit  output,  the  out¬ 
put  register.  The  test  vectors  are  applied  to  the  primary  cir¬ 
cuit  inputs  to  test  the  conversion  network,  and  the  results  are 
collected  by  the  scan  chain  and  shifted  out  from  the  Scan- 
Out  output. 

We  have  chosen  a  cascade  architecture  for  the  BR- 
addition  gates  in  the  vector  network  of  our  implementation 
as  shown  in  Figure  9.  As  a  result,  the  resulting  PPRM  net¬ 
work  in  Figure  15  can  be  tested  for  multiple  faults  with  a 
universal  test  set  described  by  Reddy  and  Saluja  in  [18]. 
The  c0  input  is  introduced  in  compliance  with  their  scheme. 
On  the  other  hand,  there  are  testing  schemes  for  tree  type 
implementations,  such  as  the  one  presented  in  [20].  How¬ 
ever,  such  schemes  may  introduce  extra  circuitry  and/or  ad¬ 
ditional  test  vectors  for  testability. 

The  PPRM  network  could  be  replaced,  for  example,  with 
an  arbitrary  AND-OR  network.  However,  in  this  case  the 
test  set  would  have  to  be  obtained  with  a  test  generation 
algorithm,  such  as  D-algorithm,  because  such  an  arbitrary 
network  is  not  as  easily  testable  as  a  PPRM. 

The  conversion  network  is  implemented  as  series  of  sin¬ 
gle  gates  if  Post  type  literals  are  used;  or  implemented  as 
series  of  binary  networks  if  more  complex  literals  are  used. 
Therefore,  for  multiple  faults,  all  of  the  gates  (or  networks) 
can  be  tested  exhaustively  in  parallel.  The  number  of  tests 
for  each  gate  (or  network)  in  the  conversion  network  then  is 
2r,  the  radix.  Since  the  gates  (networks)  are  tested  in  par¬ 
allel,  this  is  also  the  number  of  tests  required  for  the  entire 
conversion  network. 

Testing  the  scan  registers  is  also  done  separately.  This  is 
left  out  of  this  paper  since  we  are  using  conventional  scan 
registers  that  have  a  well-  known  testing  schemes  described 
by  other  researchers  [1,4]. 
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4.  CONCLUSIONS 

In  this  paper  we  showed  how  group-based  Boolean  Ring 
logic  system  leads  to  circuit  implementations  that  are  easily 
testable.  The  simplest  example  of  a  group-based  logic  sys¬ 
tem  is  the  familiar  binary  AND-EXOR  logic.  As  a  further 
exploration  of  group-based  logic  systems,  we  observed  that 
Boolean  Ring  addition  operator  has  the  cyclic  group  prop¬ 
erty  that  provides  the  high  testability  in  all  group-based  logic 
systems. 

We  developed  functional  representations  over  Boolean 
Ring  for  2r -valued  functions,  where  r  is  a  positive  integer 
number.  For  our  first  form,  which  we  call  a  GL-BRSOP 
or  Generalized-Literal  Boolean  Ring  Sum-of-Products,  the 
literals  are  found  directly  from  the  truth  table  of  the  MVL 
function.  We  illustrated  the  functional  completeness  of  the 
new  form  with  a  quaternary  example.  To  further  extend  this 
BRSOP  representation,  we  defined  another  form,  which  we 
call  UL-  BRSOP  or  Universal  Literal  Boolean  Ring  Sum- 
of-Products.  The  universal  literals  for  this  form  are  more 
complex  to  generate,  but  they  reduce  the  number  of  product 
terms. 

A  very  important  feature  of  our  BRSOP  representations 
is  that  they  can  be  implemented  with  a  vector  of  binary  AND 
gates  and  a  vector  of  binary  EXOR  gates.  Furthermore,  we 
showed  that  this  implementation  in  space  can  also  be  done 
in  a  multiplexed  form  in  time.  Our  time-multiplexed  imple¬ 
mentation  has  much  less  duplicated  circuitry  and  is  highly 
testable  using  standard  scan  techniques. 

In  summary,  the  major  advantages  of  our  BRSOPs  are: 
they  allow  MVL  functions  to  be  implemented  with  vectors 
of  binary  AND  and  EXOR  gates;  they  allow  the  use  of  a  bi¬ 
nary  logic  fault  model;  and  they  permit  a  time-multiplexed 
implementation  that  is  highly  testable  and  uses  less  redun¬ 
dant  circuitry.  We  are  currently  performing  some  measu- 
ments  with  benchmark  circuits  to  compare  our  scheme  with 
classical  binary  circuits.  We  also  plan  to  do  further  research 
with  our  BRSOPs,  and  apply  the  proposed  techniques  to 
Min-ModSum  and  other  group-based  combinational  logic 
circuits. 
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Abstract 

A  multiple-valued  current-mode  (MVCM)  circuit  based  on 
dual-rail  differential  logic  has  been  proposed  for  high¬ 
speed  arithmetic  systems  at  a  low  supply  voltage.  This 
paper  presents  a  new  totally  self-checking  circuit  based 
on  dual-rail  MVCM  logic ,  where  almost  all  the  basic 
components  except  a  differential-pair  circuit  have  been 
already  duplicated,  which  results  in  small  hardware 
overhead  compared  with  a  non-self-checking  circuit 
based  on  dual-rail  MVCM  logic .  Moreover,  the 
performance  of  the  proposed  self-checking  circuit  is 
superior  to  that  of  full  duplication  of  the  non-self- 
checking  circuit  based  on  dual-rail  MVCM  logic  in  terms 
of  transistor  counts,  switching  delay  and  dynamic  power 
dissipation  under  a  0.5-pm  standard  CMOS  technology 


1.  Introduction 

Fault-tolerance  techniques,  as  well  as  high- 
performance  system  architectures  and  circuit  design 
techniques  [l]-[3],  have  become  increasingly  important  in 
recent  submicron  VLSI  chips.  One  of  the  key 
technologies  for  the  detection  of  faults  inside  VLSI  chips 
is  to  use  a  self-checking  circuit  that  has  the  capability  to 
test  for  the  occurrence  of  a  fault  within  the  circuit  by  a 
normal  input,  as  well  as  to  detect  an  error  at  the  input 
itself.  Furthermore,  by  the  use  of  self-checking 
techniques,  transient  and  permanent  faults  can  be  also 
detected  during  normal  operations  [4],  [5].  However,  a 
self-checking  circuit  based  on  the  conventional  binary 
gates  requires  about  twice  as  many  transistors  as  the 
corresponding  non- self-checking  binary  circuit,  which 
limits  a  wide  use  of  the  self-checking  system  in  the 
present  VLSI  chips. 

On  the  other  hand,  an  MVCM  circuit  based  on  dual¬ 
rail  differential  logic  has  been  proposed  for  high-speed 
arithmetic  systems  at  a  low  supply  voltage.  A  dual-rail 


MVCM  circuit  is  used  to  make  a  signal-voltage  swing 
small  yet  driving  capability  large.  In  fact,  the  use  of  dual¬ 
rail  MVCM  logic  in  a  1.5V-supply  multiple-valued 
multiplier  chip  enables  high-speed  operations  with 
reduced  transistor  and  interconnection  counts  at  low 
power  dissipation  in  comparison  with  that  of  a 
corresponding  binary  CMOS  one  [6],  [7].  Moreover,  in 
the  dual-rail  MVCM  circuit,  there  are  three  basic 
components,  that  is,  an  adder  for  current-mode  linear 
sum,  a  comparator  and  a  differential-pair  circuit  (DPC)  to 
generate  a  dual-rail  multiple-valued  current  output.  These 
first  two  basic  components  except  a  DPC  have  been 
already  duplicated. 

In  this  paper,  a  totally  self-checking  circuit  based  on 
dual-rail  MVCM  logic  is  proposed  to  realize  highly 
reliable  VLSI  systems  with  keeping  high-speed  operation 
capability.  A  new  non-self-checking  (NSC)  DPC  proposed 
here  has  a  redundant  function  because  one  of  the  two 
outputs  depends  on  only  a  single  input  in  the  NSC-DPC. 
Accordingly,  a  self-checking  (SC)  DPC  can  be  designed 
by  the  duplication  of  the  proposed  NSC-DPC.  That  is,  the 
dual-rail  inputs  in  one  NSC-DPC  are  shared  by  the  dual¬ 
rail  inputs  in  the  other  one,  and  where  only  a  single  output 
from  each  NSC-DPC  is  used  as  the  resulting  dual-rail 
outputs  in  the  SC-DPC.  Consequently,  the  use  of  the 
proposed  DPC  makes  it  possible  to  design  a  totally  self¬ 
checking  circuit  with  keeping  high  driving  capability 
based  on  dual-rail  MVCM  logic,  which  results  in  small 
hardware  overhead  compared  with  the  corresponding 
NSC  circuit  based  on  dual -rail  MVCM  logic. 

To  confirm  the  usefulness  of  the  proposed  self¬ 
checking  circuit,  its  performance  is  compared  with  that  of 
the  corresponding  full  duplication  of  a  dual-rail  MVCM 
circuit  in  terms  of  transistor  counts,  switching  delay  and 
dynamic  power  dissipation  under  a  0.5-pm  standard 
CMOS  technology.  As  a  result,  the  number  of  transistors 
and  dynamic  power  dissipation  in  the  proposed  self¬ 
checking  circuit  is  reduced  to  about  67  percent  and  68 
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percent,  respectively,  in  comparison  with  those  of  the  full 
duplication  of  the  dual-rail  MVCM  circuit  under  the  same 
switching  delay. 

2.  Model  of  a  Multiple-Valued  Self- 
Checking  VLSI  System 

In  this  section,  we  describe  an  MVCM  circuit  model 
discussed  here,  and  define  a  totally  self-checking  circuit 
against  a  single  stuck-at  fault  in  the  VLSI  system 

2.1  Basic  components  of  a  multiple-valued  VLSI 


Figure  1  shows  three  basic  components  of  the 
proposed  MVCM  circuit.  By  the  combination  of  these 
components,  we  can  realize  any  multiple-valued 
combinational  circuits.  A  comparator  is  to  compare  an  R- 
valued  input  X  with  a  threshold  value  T ,  and  to  generate  a 
binary  output  G  where  X,  T e  R  =  {0, 1,  •••,  /?  —  1}  and 
G  e  {0, 1} .  If  X  is  less  than  7,  G  becomes  0.  Otherwise,  G 
becomes  1 .  A  DPC  is  to  generate  one  of  three  multiple¬ 
valued  differential-pair  outputs  (0,  0),  (K,  0)  and  (0,  K)  in 
accordance  with  binary  differential-pair  inputs  where  its 
function  is  defined  in  Figure  1 .  An  adder  forms  arithmetic 
sum  of  two  multiple-valued  inputs  A  and  B  where 
A,  B  €  R  .  For  example,  Figure  2  shows  a  block  diagram 
of  an  MVCM  circuit  whose  function  is  given  as 


1  (. X>T ), 

0  (X<T) 


2.2  Definition  of  a  totally  self-checking  circuit 

Self-checking  can  be  defined  as  the  ability  to  verify 
automatically,  whether  there  is  any  fault  in  logic  without 
the  need  for  externally  applied  test  stimuli.  We  will 
consider  only  malfunctions  in  the  circuit  caused  by  single 
faults.  The  concept  of  a  totally  self-checking  circuit, 
which  can  generate  a  detectable  erroneous  output  for 
every  fault  from  prescribed  set  during  normal  operations, 
is  defined  in  the  following  [4]: 

Fault  Secure  (FS):  A  circuit  is  fault  secure  if,  for  every 
fault  from  a  prescribed  set,  the  circuit  never  produces  an 
incorrect  code  output  for  code  inputs. 

Self  Testing  (ST):  A  circuit  is  self-testing  if,  for  every 
fault  from  a  prescribed  set,  the  circuit  produces  a  non¬ 
code  output  for  at  least  one  code  input. 

Totally  Self-Checking  (TSC):  A  circuit  is  totally  self¬ 
checking  if  it  is  both  self-checking  and  fault  secure. 

Generally,  a  duplicated  circuit  is  TSC.  Figure  3  shows 
a  self-checking  circuit  which  has  a  fully  duplicated 
structure  of  a  single-rail  MVCM  circuit  shown  in  Figure 
2.  Therefore,  its  hardware  becomes  twice  as  many  as  that 
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Figure  1.  Definition  of  basic  components  in  an  MVCM  circuit. 


Figure  2.  Single-rail  MVCM  threshold  detector. 


Figure  3.  Self-checking  circuit  with  a  fully  duplicated 
structure  of  two  single-rail  MVCM  threshold 
detectors. 


of  the  circuit  shown  in  Figure  2.  In  this  self-checking  R- 
valued  logic  circuit,  the  code  words  and  non-code  words 
are  shown  in  Table  1 .  The  dual-rail  code  words  satisfy  the 
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Table  1.  5-valued  dual-rail  code. 


dual-rail 

Logic  Value 

l.  X 

(*„*„) 

0 

(0,4) 

“0”  > 

1 

(1,3) 

2 

(2,  2) 

“2” 

>-  Code  word 

3 

(3,  1) 

“3” 

4 

(4,0) 

“4”  J 

- 

otherwise 

Non-code  word 

following  relation. 

X1+X0=/?-l  (2) 

2.3  High-performance  self-checking  circuit 

based  on  dual-rail  MVCM  logic 

In  order  to  improve  the  performance  of  the  single-rail 
MVCM  circuit  shown  in  Figure  3,  DPCs  must  be 
controlled  by  dual-rail  complementary  inputs.  Figure  4 
shows  a  block  diagram  of  a  self-checking  circuit  using  a 
dual-rail  MVCM  circuit.  Its  function  is  the  same  as  that 
described  by  Eq.  (1).  In  this  circuit,  its  switching  speed 
becomes  higher  than  that  of  the  self-checking  circuit 
based  on  single-rail  MVCM  logic  as  shown  in  Figure  3 
because  of  the  large  driving  capability  of  DPC.  But  it 
requires  twice  as  many  input  nodes  and  comparators  as 
the  corresponding  single-rail  MVCM  circuit,  which 
causes  serious  hardware  overhead. 

Figure  5  shows  a  block  diagram  of  the  proposed  self¬ 
checking  circuit  based  on  dual-rail  MVCM  logic.  Since 
the  functions  of  two  comparators,  COO  and  C01,  are  same 
as  two  comparators,  Cll  and  CIO,  respectively,  they  can 
be  shared  to  Cll  and  CIO  as  shown  in  Figure  5. 

Now,  let  us  discuss  about  the  self-checking  ability  of 
Figure  5.  The  DPC  has  a  redundant  function,  because  it 
depends  on  only  a  single  input  G1  as  shown  in  Figure  1. 
As  an  example,  Figure  6  shows  a  block  diagram  of  the 
proposed  self-checking  circuit  with  a  stuck-at-1  fault  at 
the  output  Gx  of  the  comparator  Cll.  The  output  of  the 
comparator  Cll  is  connected  to  Gn  and  Gqq.  However,  the 
output  Z0  in  the  DPC2  depends  on  only  G0l ,  so  that  DPC2 
operated  correctly  even  when  the  S-a-1  fault  happens  at 
Goq.  In  Ais  way,  even  if  one  comparator  fails  in  the  circuit 
of  Figure  5,  it  satisfies  the  self-checking  ability. 


Figure  4.  Duplicated  dual-rail  MVCM  threshold  detector  with 
complementary  logic. 


Figure  5.  Proposed  dual-rail  MVCM  threshold  detector. 


Figure  6.  Proposed  dual-rail  MVCM  threshold  detector  with  a 
stuck-at-1  fault. 
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Figure  7.  Behavior  of  the  proposed  DPC. 
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Figure  9.  Basic  building  blocks. 


lower  than  that  of  the  nMOS  transistor  M2.  The  lower 
threshold  voltage  increases  current  through  transistor. 
Using  this  characteristic  realizes  the  selective  switching 
of  Ml  when  both  gate  voltages  of  Ml  and  M2  are  same 
each  other  by  just  three  transistors. 

Figure  8  shows  the  HSPICE  simulation  of  a  DPC.  The 
simulation  clearly  demonstrates  that  the  expected  correct 
code  outputs  (K,  0)  are  appeared  when  their  inputs  are 
(1,1).  Figure  9  shows  the  relationship  between  the  basic 
components  and  their  circuit  diagrams.  Current  mirror 
creates  replicas  of  an  input  X.  Current  source  creates  a 
constant  value  K .  MVCM  circuits  consist  of  these  five 
components  shown  in  Figure  9. 


Figure  8.  Simulated  waveforms  of  the  circuit  of  Figure  7. 

3.  Design  of  a  Multiple-Valued  Self- 
Checking  Circuit 

Figure  7  shows  the  proposed  DPC  of  Figure  1.  The 
threshold  voltage  of  the  nMOS  transistor  Ml  becomes 


4.  Evaluation 

Figure  10  shows  a  circuit  diagram  of  the  proposed  self¬ 
checking  circuit  shown  in  Figure  5.  The  output  of  the 
comparator  is  distributed  to  two  DPCs,  so  that  a  compact 
self-checking  circuit  can  be  designed  because  of  no 
additional  comparators  for  dual-rail  MVCM  logic. 

Table  2  summarizes  the  comparison  of  performances 
among  three  kinds  of  MVCM  circuits  using  a 
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Table  2.  Comparison  of  three  MVCM  circuits. 
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Figure  10.  Self-checking  MVCM  threshold  detector. 
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circuit 
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duplicated 

MVCM 

Circuit 

Proposed 

MVCM 

I;  circuit 

TSC 

No 

Yes 

Yes 

Supply 
Voltage  [V] 

1.5 

1.5 

1.5 

Delay  [ns] 

3.5 

3.5 

3.5 

Power 

dissipation 

[pW] 

8.3 

16.7 

11.4 

Number  of 
:  Transistors 

9 

18 

12 

delay  of  the  transistor  in  comparison  with  that  of  a  normal 
transistor. 

As  a  future  problem,  it  is  also  important  to  design  a 
DPC  with  small  overhead  of  a  switching  delay  and 
keeping  small  transistor  counts. 


conventional  non-self-checking  design,  a  fully  duplicated 
design,  and  the  proposed  design,  respectively.  The  use  of 
the  proposed  NSC-DPC  makes  it  possible  to  reduce  the 
number  of  comparators  in  a  self-checking  circuit.  As  a 
result,  in  case  of  the  same  switching  delay,  the  power 
dissipation  and  the  number  of  transistors  of  the  self¬ 
checking  circuit  using  the  proposed  NSC-DPCs  are 
reduced  to  about  68  percent  and  67  percent,  respectively, 
in  comparison  with  those  of  the  corresponding  full 
duplication  of  an  NSC  dual-rail  MVCM  circuit  under  a 
0.5-fim  standard  CMOS  technology. 

5.  Conclusion 

A  new  compact  NSC-DPC  based  on  dual-rail  MVCM 
logic  has  been  proposed,  and  is  useful  for  the  realization 
of  a  totally  self-checking  system  with  keeping  high  speed 
operations  at  a  lower  supply  voltage.  In  fact,  its 
performance  is  superior  to  that  of  the  equivalent  full 
duplication  of  an  NSC  dual-rail  MVCM  circuit  in  terms 
of  transistor  counts  and  power  dissipation  under  the  same 
switching  delay. 

Although  the  use  of  the  transistor  with  a  higher 
threshold  voltage  makes  it  possible  to  design  a  compact 
DPC  with  a  redundant  function,  it  increases  the  switching 
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Abstract 

Following  T.  Y.  Lin  vague  concepts  such  as  “high  ”  or 
“small”  (“amount  of  money  ”  as  linguistic  variable)  should 
be  described  by  classes  of  “ equivalent  ” fuzzy  sets  on  a  fixed 
suitable  universe  U.  Furthermore  LIN  proposed  to  generate 
such  classes  by factorization  of the fuzzy  power  set  FF  {U)  of 
U  with  respect  to  a  given  equivalence  relation  «  on  FP((7). 
By  three  examples  we  show  that  this  approach  is  not  suc¬ 
cessful  if  we  intend  to  shift  algebraic  operations  with  fuzzy 
sets  to  classes  generated  by  equivalence  relations.  Using  a 
KRlPKE-sfy/e  semantics  approach  we  can  introduce  a  sensi¬ 
ble  concept  of  a  qualitative  fuzzy  set  and,  furthermore,  sen¬ 
sible  algebraic  operations  with  such  sets .  The  concept  of 
multi-fuzzy  set  introduced  by  Lin  is  a  special  case  of  our 
concept  of  context-depending  fuzzy  set. 

Keywords:  Fuzzy  sets,  vague  concepts,  qualitative  fuzzy 
sets,  multi- fuzzy  sets. 

1.  Introduction 

We  start  with  recalling  some  mathematical  notions  and 
notations. 

By  1R  and  [0, 1]  we  denote  the  set  of  all  real  numbers  and 
the  set  of  all  real  numbers  r  with  0  <  r  <  1,  respectively. 

Let  U  be  a  fixed  non-empty  set  called  universe.  The 
power  set  of  U  is  denoted  by  P(l/).  Furthermore,  we  will 
use  the  usual  denotations  of  crisp  set  theory,  for  instance, 
Ac\B,AkjB,  and  AxB  for  the  intersection,  union,  and  the 
CARTESian  product  of  the  crisp  sets  A  and  B ,  respectively. 
Especially,  by  An  we  denote  the  «-fold  CARTESian  product 
ofA(n>l,n  integer). 

In  the  following  we  shall  use  the  well-known  A -operator. 
Assume  A,  B ,  and  C  are  crisp  sets  and  /  :AxB-+C.  Then 

*This  research  was  supported  by  the  Deutsche  Forschungsgemeinschaft 
as  part  of  the  Collaborative  Research  Center  “Computational  Intelligence” 
(SFB  531) 


Xa f(a,  b )  denotes  the  mapping  <p  :  A  ->  C  that  is  defined  by 

<p(a)=deff(.a,b ) 

where  b  e  B  is  fixed  and  a  runs  through^.  The  mapping 
Xbf(a ,  b)  is  defined  analogously. 

A  fuzzy  set  F  on  U  is  a  mapping  F :  U  -» [0, 1],  i.  e.  we  do 
not  distinguish  between  a  fuzzy  set  F  on  U  and  its  member¬ 
ship  function  pf.  The  set  {F|F :  U  -»  [0, 1]}  is  called  fuzzy 
power  set  of  U  and  denoted  by  FP (U).  For  C  c  [0, 1  ]  we  de¬ 
fine  F~l(C)  =def  {x\x  e  U  aF(x)  e  C }. 

An  n-ary  operation  (n  >  1,  n  integer)  on  [0, 1]  is  a  map¬ 
ping  op  :  [0,  If  ->  [0, 1]  where  [0, 1]"  denotes  the  n-fold 
CARTESian  product  of  [0, 1]. 

As  usual  we  “shift”  a  given  operation  op  to  an  operation 
Op  for  fuzzy  sets  Fj, ...  ,Fn  on  U  as  follows  where  x  e  U: 

Op(F] ,...,  F„)(x)  -def  op(F\  (x) . F„(x)). 

In  daily  life  we  can  come  across  a  lot  of  “computing 
rules”  having  the  form 

“high  plus  small  is  high” 
or 

“many  minus  small  is  many”. 

Such  rules  can  be  denoted  as  rules  for  computing  with 
“vague  quantities”  or  with  “fuzzy  qualities”  or,  interpret¬ 
ing  proposals  made  by  L.  A.  Zadeh  [30,  32,  34,  35],  as 
rules  for  “computing  with  words”.  In  the  papers  [24, 25]  we 
have  called  such  rules  “computing  with  words  in  the  narrow 
sense”  while  by  “computing  with  words  in  the  wide  sense” 
we  understand  the  use  of  vague  concepts  in  (approximate) 
reasoning,  in  modelling,  and  in  programming. 

In  the  framework  of  the  usual  fuzzy  arithmetics  a  rule  of 
the  form  “high  plus  small  is  high”  should  be  interpreted  and 
evaluated  as  follows: 

1 .  Take  as  the  universe  U  the  set  IR  of  all  real  numbers 

2.  interpret  the  words  “high”  and  “small”  by  real  fuzzy 
numbers  d  and  cr,  respectively 
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3 .  interpret  the  word  “plus”  by  an  addition  +  for  real  fuzzy 
numbers,  for  instance,  if  a  and  J3  are  arbitrary  fuzzy 
numbers  then  +  is  defined  by 

(a  +  p)(x)  =def  sup  {a(y)+P(z)\y,z  e  JRax  =y+z} 
for  every  x  e  IR 

4.  interpret  the  word  “is”  by  an  identity  relation  =  for  real 
fuzzy  numbers 

5.  make  sure  that  the  equation  0 + a  =  0  holds. 

Now,  we  have  to  state  that  the  considerations  above 
highly  depend  on  the  context.  Take,  for  instance,  the  lin¬ 
guistic  variable  “amount  of  money”  and  for  this  variable  the 
linguistic  terms  “high”  and  “small”.  Then  for  an  oil  sheik 
or  “Deutsche  Bank”  the  words  “high”  and  “small”  are  to  be 
interpreted,  for  instance,  by  real  fuzzy  numbers  describing 
an  amount  of  about  1  billion  dollars  and  500  thousand  dol¬ 
lars  (“peanuts”),  respectively,  while  for  an  unemployed  per¬ 
son  these  words  are  to  be  interpreted,  for  instance,  by  real 
fuzzy  numbers  describing  an  amount  of  about  500  dollars 
and  2  dollars,  respectively. 

From  the  discussion  above  we  derive  the  following  state¬ 
ments  and  claims:  Assume  we  have  fixed  U  as  IR  and  the 
domain  of  all  real  fuzzy  numbers  as  possible  interpretations 
of  the  words  “high”  and  “small”.  Furthermore  assume  that 
we  have  fixed  an  interpretation  of  the  words  “plus”  and  “is”. 
Then 

1.  Consider  all  interpretations  of  the  words  “high”  and 
“small”  such  that  die  equation 

“high  plus  small  is  high” 

holds. 

2.  Consider  all  such  interpretations  of  the  word  “high”  as 
a  class  of  “equivalent”  (A.  Kandel  uses  the  word  “ad¬ 
missible  function”  [7]  while  T.  Y.  Lin  prefers  to  speak 
about  “equivalent”  fuzzy  sets  [12])  real  fuzzy  numbers 
and  call  this  class  a  “qualitative”  real  fuzzy  number  and 
consider  this  class  as  a  “qualitative”  interpretation  of 
the  word  “high”.  Do  the  same  for  the  word  “small”. 

3.  We  intend  to  develop  a  “qualitative”  interpretation,  il¬ 
lustrated  by  the  equation 

“high  plus  small  is  high” 

as  follows: 

3.1.  Interpret  the  words  “high”  and  “small”  by  classes 
of  “equivalent”  real  fuzzy  numbers. 

3 .2.  Define  an  addition  and  an  identity  for  classes  such 
that  the  equation  above  holds. 

On  the  basis  of  our  knowledge  from  classical  algebra  one 
could  think  that 


•  the  “equivalence  classes”  could  be  generated  by  equiv¬ 
alence  relations  for  fuzzy  sets 

•  the  operations  could  be  defined  as  for  factor  structures 
in  classical  algebra. 

In  the  next  chapter  we  show  by  three  examples  that  this 
approach  is  not  successful,  in  general. 


2.  Three  counterexamples  for  the  classical 
“factorization”  approach 


Counterexample  1 


For  definiteness  we  recall  some  notions  and  notations  of 
the  general  topology,  but  in  a  simplified  form  and  adapted  to 
the  given  case. 

By  CLOS([ 0, 1])  we  denote  the  set  of  all  sets  Cc[0,l] 
that  are  closed  with  respect  to  the  usual  topology  in  [0, 1]. 

Assume  T  c  P((7). 

Definition  2.1 

T  is  said  to  be  a  topology  on  U 

=def  1.  Ve(6cT->f|6eT) 

2.  VSVT(S,Te  <Z->SuT  e  T) 


Assume  F  :  U  [0, 1].  Then  one  defines  the  topology 
TOP(F,  U)  that  is  generated  by  F  on  U  as  follows. 

Definition  2.2 


TOP(F,  U) 


L1UU  4.4  / 

?.U)=def  n{- 


{F_1(C)|C  e  CLOS([0, 1])}  i 
aT  is  a  topology  on  U 


Obviously,  TOP(F,  U)  is  the  “smallest”  topology  T  on  U 
such  that  the  function  F  is  continuous  with  respect  to  T. 


Assume  h  :  [0, 1]  — » [0, 1]. 

Definition  2.3  (see  Lin  |12]) 
h  is  said  to  be  a  relative  self-homeomorphism  of  [0, 1] 
~def  1.  his  a  bijection  between  [0, 1]  and  [0, 1] 

2.  h  and  h~l  are  continuous 

3 .  h(0)  =  0  and h(\)=  1. 


Assume  F,G:U-+[  0, 1]. 

Definition  2.4  (see  Lin  [12]) 

F  »G 

=i/  1.  TOP{F,  U)  =  TOP(G,  U)  and 

2.  there  exists  a  self-homeomorphism  h  such  that  for 
every  x  e  U 

F(x)  -  h(G(x)). 

Obviously,  «  is  an  equivalence  relation  on  FP(t/).  Hence 
we  can  construct  the  set  FP(i7)/«  of  “LiN-equivalence 

L 

classes”  with  respect  to  w.  We  call  classes  £  e  FP  (£/)/« 

L  L 

“LiN-qualitative  fuzzy  sets”. 
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Now,  we  shall  show  that  very  important  operations  with 
fuzzy  sets  F,Ge  FP(U)  cannot  be  shifted  to  operations  with 
LiN-qualitative  fuzzy  sets. 

Consider  the  Lukasiewicz  disjunction  veil  defined  by 
veli(r,  s)  =  min(l,  r+s) 

for  r,s  e  [0, 1].  Furthermore,  consider  the  operation  Veit  for 
F,G  e  FP(fT)  defined  by 

Veli(F,  G)(x)  =def  min(l,  F(x)  +  G(x)) 

for  x  e  U. 

Now  we  construct  a  universe  U  and  fuzzy  sets  F,  G,  and 
H  on  U  such  that 

1.  F  tsG  but 

L 

2.  not  Velt(F,H)  «  Veli(G,H). 

This  result  means  that  the  operation  Veil  which  is  very  im¬ 
portant  for  studying  MV-algebras  cannot  be  shifted  from 
fuzzy  sets  on  U  to  LiN-qualitative  fuzzy  sets  on  U,  i.  e. 
classes  from  FP([/)/»  • 

L 

To  prove  this  result  we  put  U  =def  {a,b,c}  with 
a&bAb&CAC&a, 


For  u  =  b  we  have 

Velt(G,H)(b)=\ 
h'(Veh(FH)(b))  =  h'(l)=l, 
hence  we  get  the  contradiction 


Counterexample  2 

One  could  think  that  counterexample  1  works  because  we 
have  chosen  the  function  veil  ■  In  the  following  we  are  going 
to  show  that  the  same  holds  if  we  replace  veil  by  the  essen¬ 
tially  simpler  function  min.  We  put 

U  =def  {a,b,c,d}  where a*b,  b*c,c*d,  and d^a 
F(a)  -def  0  G(a)  -def  0  H(a)  =def  0 

F(b)  ~def  Jo  G(b)  —def  ^  H(F)  —def  TO 

F(c)  —def  TO  G(c)  —def  TO  H{c)  ~def  TO 

F{d)=def\  G(d)=def  1  H{d)=def\ 

Then  we  get 

TOP(F)  =  TOP(G)  =  P(t/). 


F{d)-def  o 
F(b)=def  | 
F(c)  -def  1 


G(a)  -def  0 
G{b)=def\ 
G(c)  —def  1 


H(a)=def  0 

Hib)=def\ 

H(c)=def  1 


h(r)  -def 


3  r  if  0  <  r  <  | 
5  + 1  if  |  <  r  <  1 


Then  by  definition  we  get 

P ((/)  =  TOP(F,  U)  =  TOP(G,  U). 

Furthermore,  we  have  that  the  function  h  defined  above  is  a 
relative  self-homeomorphism  of  [0, 1]  and 

Vw(w  €  U  F(u)  =  h(G(u))). 

Hence,  we  have  F  «  G, 

Now,  we  get 

Fe/L(F,/f)(b)  =  min(l,F(b)+ff(b))  =  min(l,|  +  I)  =  1 
Veli(G,H)(b)  =  mind,  G(b) +H(b ))  =  min  (l,  | 

Now,  we  assume 


VelL(FfH)*Velt(G,H), 


so  we  get 


VelL(G,H)*VelL(F,H), 

hence  there  exists  a  relative  self-homeomorphism  h'  of  U 
such  that 


Furthermore,  it  is  trivial  that  there  exists  a  self- 
homeomorphism  h  of  [0, 1]  such  that  for  every  jc  e  U  the 
equation  F(x)  =  h(G(x ))  holds,  hence  F  «  G. 

Now,  we  obtain 

min(F(a),  H(a))  -  0  min(G(a),  H{a))  =  0 
min (F(b),H{b))  =  ^  min (G(b),H(b))  =  & 

min (F(c),  H(c))  =  ^  min(G(c),  H{c))  =  ± 

min(F(d),  H(d))  =  1  min(G(</),  H(d))  =  1 

Obviously,  we  get  Min{FyH)  V  Min(G,  H)  because  there  is 

no  self-homeomorphism  h  such  that  for  every  x  e  U  the 
equation  Min(Fp  H){x)  -  h(Min(G,  H)(x))  holds. 

Counterexample  3 

Now,  we  discuss  an  example  which  is  more  lucid  than  the 
examples  1  and  2  coming  from  Lin’s  concept  of  a  qualita¬ 
tive  fuzzy  set. 

Our  example  will  very  instructively  demonstrate  that  the 
concept  of  qualitative  fuzzy  set  based  on  an  equivalence  re¬ 
lation  *  on  FP(L0  is  not  suitable  for  constructing  qualitative 
fuzzy  sets  by  “factorization”  with  respect  to  « in  the  sense  of 
classical  algebra. 

Assume  that  the  universe  used  is  the  set  IR  of  all  real  num¬ 
bers,  i.  e.  we  put 

U  — def  IR- 


V«(w  e  U  ->  Vek(G,H)(u)  =  h'(Vek(F,H)(u))).  Assumed  and  G  are  fuzzy  sets  on  U,  i.  e.  F,  G :  IR-»  [0, 1]. 
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Definition  2.5 

F&G  =def3c(c  e  IRa Vx(x  e  IR -> F(x)  =  G(x+c ))) 

This  definition  means  that  F  and  G  are  equivalent  if  and 
only  if  they  are  congruent  modulo  a  translation  along  the  real 
axis. 

Obviously,  *  is  an  equivalence  relation  on  FP(IR). 

Now,  for  arbitrary  fuzzy  sets  F  and  H  on  IR  define  the 
operation  Min(F,  H)  as  follows  where  x  e  IR 

Min(F,  H){x)  =&/  min(F(x),  H(x)). 

Then  the  following  picture  shows  that  F  »  G  does  not  imply 
Min(F,  H)  m  Min(G,  H),  in  general. 


Obviously,  we  have  F  «  G,  but  Min(F,H)  *  0  and 
Min(G,  H)  =  0,  hence  Min(F,H)  *JMin(G,H). 

3.  A  KRlPKE-style  semantics  approach  to  a 
sensible  concept  of  qualitative  fuzzy  sets 

Let  W  be  a  non-empty  set  of  possible  worlds. 

Definition  3.1 

<1>  is  said  to  be  a  context-depending  fuzzy  set  on  U  with  re¬ 
spect  to  W 

=defd>:WxU^>[ 0,1] 

At  the  first  glance  a  context-depending  fuzzy  set  on  U 
with  respect  to  W  can  be  considered  as  a  usual  fuzzy  set  on 
the  CARTESian  product  W  x  JJ  or  as  a  binary  fuzzy  relation 
between  the  sets  W  and  U. 

The  key  point  of  our  following  approach  consists  of  using 
a  function  <!> :  W  x  U  ->  [0, 1]  for  describing  a  “vague  con¬ 
cept”  on  U,  for  instance,  the  vague  concept  “high”  (amount 
of  money). 

If  we  put  W  =def  { 1, . .. ,  n)  (n  >  1,  n  integer)  then  our  con¬ 
cept  of  a  context-depending  fuzzy  set  d> :  W  x  U  ->  [0, 1]  (on 
U  with  respect  to  W )  coincides  with  Lin’s  concept  of  multi- 
fuzzy  set  (see  [14]). 

Strictly  speaking,  a  given  context-depending  fuzzy  set 
®  :  WxU  -*  [  0,1]  generates  a  class  <£$  of  “equivalent” 
fuzzy  sets  (or  “admissible”  fuzzy  sets)  on  U  as  follows. 
Definition  3.2 
<Lt,  =def{ Aw®(w,tt)|w  e  W} 

Now,  we  consider  the  fuzzy  power  set  FF(W  x  U).  Using 
the  concepts  introduced  by  definition  3.1  and  3.2  we  intro¬ 
duce  a  new  “factorization”  of  FP(l/)/jp  of  the  “classical” 
power  set  FF(U)  as  follows. 


Definition  3.3 

FP  (U)/W  =def{U\®-WxU^[0,\}) 

Comparing  the  construction  FF(U)/jy  with  the  classical 
construction  FP  (£/)/«  where  ~  is  an  equivalence  relation  on 
FP((7)  we  can  state: 

While  FP(£/)/*  is  a  partition  of  FP(t/)  we  are  faced  with 
the  following  two  important  facts: 

1.  FP  (U)/jy  is  not  a  covering,  in  general 

2.  FP (U)/fp  is  not  disjoint,  in  general. 

The  fact  that  FF(U)/jy  is  not  a  covering  must  be  inter¬ 
preted  so  that  there  exists  a  fuzzy  set  F  e  FP (U)  which  is 
not  an  “admissible”  fuzzy  set  for  any  class  €  e  FP  (U)/\y, 

i.  e.  which  is  not  an  “admissible”  fuzzy  set  for  any  “vague 
concept”  O :  W  x  U  -» [0, 1  ],  i.  e.  for  an  arbitrary  “vague  con¬ 
cept”  ®  :  Wx  U  -» [0, 1]  there  is  no  world  w  e  W  such  that 
F  =  Am®(m',u). 

The  fact  that  FP (U)/pr  can  be  not  disjoint  means  that 
there  can  exist  “vague  concepts”  ®  :  W  x  U  -»  [0, 1]  and 
'¥:WxU-+  [0,1]  such  that 

1 .  C<b 54  and 

2.  €<j)  o  <£>p  ^  0. 

From  conclusion  1  we  get  that 

3.  3iv(w  e  W  a  Vw'(w'  e  W  ->  A.m®(m>,  u)  *  Xu'¥(w',  u))) 
or3w'(w'  €  WA\/w(weW->k m^w', u)*X u®(w, u))) . 

From  conclusion  2  we  get  that 

4.  3w3v/(w,  w'eW a Xu<t>(w,  u)  =  AuvF(w', «)). 

If  we  interpret  the  vague  concept  “high”  (amount  of 
money)  by  ®  and  the  vague  concept  “small”  (amount  of 
money)  by  4/  then  the  condition  £$  ^  means  that  these 
vague  concepts  are  different. 

Furthermore,  the  conclusion  4  could  be  interpreted  that 
the  same  fuzzy  set  F  e  FF(U)  for  an  unemployed  person 
w  describes  a  high  amount,  i.  e.  F  =  Xud>(w,  u),  while  for  an 
oil  sheik  w'  the  same  fuzzy  sets  describes  a  small  amount  (of 
money),  i.  e.  F  =  /.u'¥(w',  u),  for  instance. 

Now,  we  shall  discuss  how  operations 

Opn  :  FP (U)n  FP(t/)  (n  >  1 ,  n  integer) 

can  be  “lifted”  to  operations 

OP*  :  {FF(U)/W)n  FF{U)/W. 

Using  the  terminology  and  “philosophy”  of  classical  algebra 
in  constructing  factor  structures  from  given  algebraic  struc¬ 
tures  (groups,  rings,  lattices, ...  )  for  ®,  ¥ :  W  x  U  [0, 1] 
and  £  e  FP  (U)/yp  we  define: 
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Definition  3.4 

1.  <t>  is  said  to  be  a  representative  of  the  class  € 

~def  &  ~  *£<J> 

2.  <1>  and  'V  are  said  to  be  equivalent  (shortly: 

~def  =  &P 

Proposition  3.1 

The  relation  « is  an  equivalence  relation  on  FP (W  x  U). 
Now  we  are  going  to  discuss  how  operation 
OP"  :  FP(£/)"  ->  FP(C/) 
can  be  lifted  to  operations 

OPn:(FF(U)/w)n^mU)/w 

Assume  e  FP  (t/)/ff  and 

d>i, ...  ,<E>„  :  W  x  U  ->  [0, 1]  are  representatives  of  these 
classes,  i.  e.  we  have 

£i  =  Co,,...,  C»  =  C<v 

Now,  using  the  “philosophy”  of  classical  algebra  we  try  to 
define  the  result 

. «.) 

of  the  operation  OF1  applied  to  the  arguments  £  \ , . . . ,  <£„  by 
using  the  representatives  O  j , . . . ,  O*  and  the  operation  Opn. 
From  the  classical  algebra  we  know  the  important  claim  that 
we  have  to  prove  the  independence  of  0P”(<£i , . . . ,  £„)  from 
the  choice  of  the  representatives  d>i, ...  ,d>„  of  the  classes 

It  is  surprising  and  for  the  following  very  important  that 
this  assertion  can  be  proved  for  1-ary  operations 

<9/?:FP(£/)-»FP(t/) 

without  any  difficulties  and  additional  assumptions. 

In  contrast  to  this  fact  in  the  w-ary  case  for  n  >  2  (n  inte¬ 
ger)  we  are  faced  with  serious  difficulties  that  can  be  over¬ 
come  only  by  additional  assumptions,  in  general. 

We  underline  that  this  situation  as  a  matter  of  principle 
differs  from  the  situation  which  we  know  from  the  “factor¬ 
ization  theory”  in  classical  algebra. 

The  case  of  1-ary  operations 

Assume 

Op  :  FP(C/)  — »  FP(£/), 

£  e  FP  (TJ)/W, 

with  C  =  Co- 

Definition  3.5 

OP0(<t)  ~def  {Op(XuQ>(w,  U))\W  E  Wj 
Theorem  3.2 

1.  OPd>(£)  e  F V(U)/W 

2.  IfU  =  £y  then  OP0(£)  =  OPy(€). 


Remarks 

1 .  Theorem  3 .2  means  that  OP^(€)  is  independent  of  O  if 
£==£*. 

2.  OPq)(£)  can  also  be  considered  as  a  mapping  from 
FP (W  x  £/)/«  into  FP(IF  x  U)fa . 

The  case  of  n-ary  Operations  Op  :  FP(C/)"  -»  FP(C7) 
where  n  >  2,  n  integer 

Without  loss  of  generality  we  can  assume  that  n  =  2. 
Furthermore,  we  assume 

Op  :  F T(U)  x  FP(£7)  ->  FP (*7), 
<Lh€2e¥lP(U)/w, 

C^Cop^Co* 

Definition  3.6 

OP&lt4>2(£iX2)=def{Op(hu<$i(w,u\hu<&2(wfu))\w  e  W} 

Theorem  3.3 

1.  OPt p,,<j>2(£i,  £2)  e  ¥¥(U)/W 

2.  The  implication 

If  and  C<d2  =  £vp2  then 

OP®lt(i>2(£\,  £2) =  OP¥{y2(<£\,  £2) 

does  not  hold ,  in  general 

Now,  we  are  going  to  introduce  an  additional  assumption 
such  that  theorem  3.3  holds. 

From  the  assumptions 

Co, 

and 

£<d2  =  ^2 

we  obtain  (see  the  proof  of  theorem  3 .2)  that  there  exist  map¬ 
pings  a2,fh.  '.W  ->W  such  that  for  /e  {1,2}, 

Vw(w  eW->  Aw<D,(w,  u)  =  A wvFj(a/(w),  u)) 
and 

Vw'(w'  <=W->  A u%(wf,  u)  =  A«0/(j3/(wf),  u)) 

hold. 

Now,  we  assume  that  the  mappings  described  above  can 
be  chosen  such  that  cq  =  a2  and  (i\  =  f32 ,  i.  e.  we  define 

Definition  3.7 

[<E>i,4'i]  and  [$2*^2]  have  a  common  Skolem ization 
~def  There  are  mappings  a,  (5  :  W  ->  W  such  that  for 
i  e  {1,2}, 

Vw(w  e  W  ->  A  ud>i(w,  u)  =  A«vF/(a(w), w)) 
and 

Vw'(w'  E  W-*  A«'FI(w',n)  =  A«0/(i3(vv,),M)) 
hold. 


286 


Then  we  have 

Theorem  3.4 

Ifl.  Ci  =  £<pj  =  Cvp,  and  C2  =  C<p2  =  C^2 

2.  [Oij'Fi]  and[®2,¥2] haveacommon  SKOLEMizafion 
then  OP<t>lt<b2(Ci,  C2)  =  OPylty2(<£\,  C2). 
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Abstract 

This  paper  describes  some  classes  of  fuzzy  information 
granularity  and  their  representation  methods.  Fuzzy 
information  granularity  introduced  by  Zadeh  is  that  " 
granularity  relates  to  clumpiness  of  structure ,  while 
granulation  refers  to  partitioning  an  object  into  a  collection 
of  granules,  with  a  granule  being  a  clump  of 
objects(points)  drawn  together  by  indistinguishability , 
similarity ,  proximity ,  or  functionality. "  In  this  paper  we 
show  three  classes  of  granularity  structures ,  which  are 
called  Kleene  class,  Lukasiewicz  class  and  probabilisticlike 
class.  Their  meaning  and  representation  methods  are 
discussed.  Their  examples  are  also  demonstrated.  This 
paper  would  be  a  basis  to  research  on  the  representation  of 
fuzzy  information  granularity. 

1.  Introduction 

A  concept  that  plays  a  pivotal  role  in  fuzzy  logic[l]-[4]  is 
that  of  fuzzy  information  granulation[l].  In  crisp 
information  granulation,  the  granules  are  crisp,  while  in 
fuzzy  information  granulation  the  granules  are  fuzzy. 
Granularity  relates  to  dumpiness  of  structure,  while 
granulation  involves  decomposition  of  the  whole  into  the 
parts,  i.e.,  an  object  into  a  collection  of  granules.  The 
granule  is  a  clump  of  objects  drawn  together  by 
indistinguishability,  similarity,  proximity,  or 
functionality.  Moreover,  each  fuzzy  attribute  is  associated 
with  a  set  of  fuzzy  values.  Our  departure  point  of  this 
study  is  that  for  a  fuzzy  granularity  we  will  clarify  the 
structure  of  the  dumpiness  by  representing  it  by  fuzzy 
logic  operators.  By  exploring  it,  we  would  be  form  the 
basis  of  understanding  the  dumpiness,  classify  it  and  then 
represent  it. 

This  paper  describes  three  classes  of  fuzzy  information 
granularity  (FIG  for  short)  and  their  representation 
methods.  First,  the  three  classes  are  called  by:  Kleene[5] 
class,  which  can  be  expressed  by  maximum,  minimum  and 
not  operations;  Lukasiewicz  class,  which  can  be  expressed 
by  bold  union,  bold  intersection^]  and  not;  Probability 
class,  which  can  be  expressed  by  probabilisticlike  sum, 
probabilisticlike  product  and  not.  Second,  as  a 
representative  of  Kleene  class,  we  introduce  fuzzy  logic 
functions[6]-[10],  which  is  a  model  Kleene  algebra[5]-[12]. 
The  granularity  expressed  by  fuzzy  logic  functions  are 
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shown  and  then  an  application  to  image  granulation  is 
demonstrated.  Third,  Lukasiewicz  class  is  introduced  as  the 
class  that  can  express  FIG  including  information  specified 
by  standard  arithmetic  sum  and  subtraction  calculation.  As 
the  application,  we  show  the  granulation  of  human  brain 
MR  image[14][15][16].  Finally,  Probability  class  is 
introduced  as  the  class  of  FIG  including  probabilistic 
events.  This  paper  is  organized  below.  Section  2  defines 
three  classes  mentioned  above.  Section  3  describes  Kleene 
class  of  FIG.  First,  fuzzy  logic  function  is  defined  as  the 
representative,  and  then  is  applied  to  granulate  a  given 
image.  Section  3  describes  Lukasiewicz  class  and  the 
effectiveness.  Section  4  shows  the  example  of  Probability 
class.  Section  5  is  the  conclusions. 

2.  Preliminaries 

As  far  as  fuzzy  information  granulation[l]  introduced  by 
Zadeh,  the  granules  are  fuzzy.  Granularity  relates  to 
dumpiness  of  structure,  while  granulation  involves 
decomposition  of  the  whole  into  the  parts.  Figure  1 
shows  the  structure  of  granulation  shown  by  Zadeh.  Thus, 
granulation  can  be  decomposed  into  information 
granulation  and  action  granulation,  the  information 
granulation  can  be  decomposed  into  fuzzy  granulation  and 
crisp  granulation.  Consider  a  granularity  of  an  object, 
Zadeh  shows  the  structure  shown  in  Figure  2.  On 
focusing  on  the  fuzzy  information  granularity,  we  have  to 
clarify  the  granulation  method,  representation  method  of 
the  granularity.  In  this  paper,  we  will  consider  fuzzy 
information  granularity,  especially,  granularity 
representation  method.  Crisp  information  granularity  may 
have  the  same  representation  method.  First,  we  consider 
the  following  three  classes  based  on  logic  operators. 

Let  V  =  [0,  1]  be  the  interval  between  0  and  1,  and  let 
X  =  (xl,  x2,  ... ,  xn)  be  a  set  of  n  variables  where  xi  takes 
on  values  from  U.  A  function  f(X)  is  a  mapping  f  :  Un 
— »ljl.  f(X)  is  said  to  be  an  n- variable  fuzzy  function.  In 
this  paper,  the  following  three  sets  of  fuzzy  logical 
operators  are  considered. 

(1)  <MIN,  MAX.  NOT>  :  Kleene[5]  class 

1)  MIN  :  f(xl,  x2)  =  xIa  x2  =  minimum  (xl,x2), 

2)  MAX  :  f(xl,  x2)  =  xl  v  x2  =  maximum  (xl,  x2). 

3)  NOT:  f(xl)  =  xl=l-xl 
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(2)  <BOLD  INTERSECTION,  UNION,  NOT>[3]  : 

Lukasiewicz  class 

1) BOLD  INTERSECTION  :  f(xl,  x2)  =  xl*x2  = 
maximum  (xl+  x2-l,  0), where  "+"  is  arithmetic  sum, 

2)  BOLD  UNION:  f(xl,  x2)  =  xl  +  x2  =  minimum  (xl+ 
x2,  1),  where  "+”  is  arithmetic  sum, 

3)  NOT:  f(xl)  =  xl=l-xl 

Bold  intersection  and  union  are  also  refered  as  bounded 
product  and  sum,  respectively.  Since  Lukasiewicz 
implication  A->  B  (=  minimum  (  (1-A)  +  B,  1))  can  be 
expressed  by  A  +  B,  we  call  this  Lukasiewicz  class 

(3)  <PROBABILISTICLIKE  PRODUCT,  SUM, 
NOT>:  Probability  class 

1)  PROBABILISTICLIKE  PRODUCT : 
f(xl,  x2)  =  xl  x  x2 

2)  PROBABILISTICLIKE  SUM : 

f(xl,  x2)  =  xl  V  x2  =  xl  +  x2  -  xl  x  x2 

3)  NOT:  f(xl)  =  xl=l-xl 


Figure  1  A  granulation  relation. 


Figure  2  Granularity,  attribute  and  values. 


In  the  following  sections,  we  are  considering  these  fuzzy 
information  granularity  (FIG  for  short)  representation,  and 
then  show  their  representation  methods  and  their 
application  area. 


3.  Representation  of  FIG  on  Kleene 
Class 

This  section  describes  FIG  representation  on  Kleene  class. 
First  we  show  the  definition  of  fuzzy  logic  functions[6]- 
[10],  which  is  a  model  of  Kleene  algebra[5][l  1][12]. 

[Definition  1]  Fuzzy  logic  formulas  are  defined  as  follows: 

(1)  A  variable  Xi  is  a  fuzzy  logic  formula. 

(2)  If  Hi  and  Hj  are  fuzzy  logic  formulas.  Hi  HivHj  and 
HiAHj  are  fuzzy  logic  formulas. 

(3)  The  only  formulas  are  those  given  by  rules  (1)  and  (2). 
Fuzzy  logic  functions  are  defined  as  the  fuzzy  functions 
that  can  be  represented  by  the  fuzzy  logic  formulas. 

All  fuzzy  logic  functions  can  be  represented  by  both  sum 
of  product  and  product  of  sum,  because  Idempotency 
(XaX=X,  XvX=X),  Distributively  X1a(X2vX3)  = 
X1aX2vX1aX3,  X1vX2aX3  =(  X1vX2)a(X1vX3)  and  so 
on  are  approved  in  fuzzy  logic.  But  complementation 
(XaX=0,  XvX=1)  does  not  hold,  whereas  the  Kleene's 
law  (XaX  <YvY)  does  hold.  So  the  term  with  both  Xi 
and  Xi  can  remain.  Such  product  terms  are  called 
complementary  product  terms,  and  alterms  are  called 
complementary  alterms.  The  other  product  terms  called 
simple  product  terms,  and  alterms  are  called  simple 
alterms. 

[Example  1]  Let  the  following  terms  be  the  terms  of 
three-variable  fuzzy  logic  functions.  Simple  product 
terms,  simple  alterms,  complimentary  product  terms, 
complementary  alterms,  minterms  and  maxterms  are  those 
below,  respectively, 

simple  product  terms:  Xl,  X1aX2,  X1aX3 
simple  alterms:  X2vX3,  XlvX2  __ 

complementary  product  terms:  X1a Xl,  Xl aX2a  X2 
complementary  alterms:  X3vX3,  XlvXlvX2 

Any  fuzzy  logic  function  F  can  be  expressed  by  the 
following  form, 

F  =  aj  va2v....van,  (1) 

where  ax  is  one  of  simple  product  term  or  complementary 
product  term. 

F  =  j31A&A....Aj3m,  (2) 

where  $  is  one  of  simple  alterm  or  complementary 

alterm. 

Considering  the  functions  F  as  the  fuzzy  information 
a{  (i=l,  ...,n)  and  /?,  (i=l,  ...,  m)  are  the  fuzzy  granules. 
Here,  the  method  to  identify  and  then  represent  fuzzy  logic 
formula  for  a  given  fuzzy  function,  see  References[l  1]  and 
[12] 

[Example  2]  We  consider  the  granularity  of  the  image  of 
fuzzy  function  F  in  Figure  3.  In  it,  more  white  pixel 
includes  higher  value  in  [0,  1];  The  upper  illustrations 


show  the  shapes  of  projection  by  putting  a  light  from 
direction  of  the  axis  of  X2  in  F,  Gl,  G2,  XI  and  XI, 
X2=l/2.  When  we  represent  the  image  by  fuzzy  logic 
function,  F  can  be  expressed  by  (XI  a  XI)  a  (X2  a 
X2).  In  the  similar  way,  Gl=  XI  a  XI  and  G2=  X2  a 
X2.  Clearly  minimum(Gl,  G2)=F.  The  image  is  thus 
granulated  to  fuzzy  functions  Gl  and  G2. 


IV 


0  XI  1 


0  xi  1 


can  derive  two  granules  (XI  a  XI)  and  (X2  a  X2), 
Moreover,  second  granulation  of  (XI  a  XI)  can  derive  XI 
and  XI,  and  that  of  (X2  a  X2)  can  derive  X2  and  X2. 
Here  the  set  of  (XI  a  XI)  and  (X2  a  X2)  is  complete 
granule  set.  The  set  of  XI  and  XI  is  complete  granule  set 
with  respect  to  (XI  a  XI),  but  is  not  complete  granule  set 
with  respect  to  F.  However,  the  set  of  XI,  XI,  X2  and 
X2  is  a  complete  granule  set  with  respect  to  F. 

Thus,  if  we  can  represent  the  given  object  by  fuzzy  logic 
functions,  we  can  easily  represent  and  then  granulate  it  to 
granules. 


XI 

Gl 


XI 


0  XI  1 


G2 

1 
F 

0  XI  i 
1 

X2 

0  Xl_  1 
XI 


Figure  3  An  illustration  on  FIG  of  fuzzy  logic 
function 


We  define  the  complete  granule  set  below. 

[Definition  2]  Suppose  a  granularity  of  an  object,  the 
granule  set  that  can  completely  express  the  granularity  is 
called  as  complete  granule  set  with  respect  to  the  object. 

Considering  Equations  (1)  and  (2),  the  sets  {  oq,  a2 , 
an }  and  { ft,  >  — »  Pn )  m  complete  granule  sets 
with  respect  to  F.  When  ax  -  Y\  A  Yi  A...Ayk,  the  set 
{  Y\  >  72>  7k )  *s  a  complete  granule  set  with  respect  to 
ax.  We  can  obtain  the  same  discussion  with  respect  to  fa. 


4.  Representation  of  FIG  on  Lukasiewicz 
Class 

The  class  which  can  be  expressed  by  bold  intersection, 
union  and  NOT  is  considered  in  this  section.  The  class 
satisfies  commutativity,  associativity,  identity,  De 
Morgan’s  laws,  excluded-middle  laws  and  involution.  But 
idempotency,  distributivity  and  absorption  are  no  longer. 
Therefore,  it  is  immediate  that,  with  the  operations,  the 
subset  is  not  a  lattice  but  a  complemented  non  distributive 
structure.  When  {0,  1},  the  set  is  a  complemented 
distributive  lattice,  i.e.,  a  Boolean  algebra. 

Many  of  pattern  recognition  and  Sensor-based  detection 
system  problems  may  be  classified  into  this  class,  because 
bold  union  means  arithmetic  summation  under  the 
assumption  that  its  value  is  not  beyond  a  limitation,  where 
we  denote  the  limitation  by  "1".  If  we  consider  a  sensor 
which  detects  some  signal  from  an  object,  the  sensor  has 
physically  detectable  limitation  exactly.  Considering  the 
limitation  by  the  logic  value  ”1",  the  class  would  widely 
be  applicable  to  real  problems. 


Figure  4  An  illustration  of  granulation  on 
Lukasiewicz  class. 


[Example  3]  Consider  the  fuzzy  logic  function  F-  (XI  a  Suppose  a  granulation  of  the  square  in  Figure  4(a). 

XI)  a  (X2  a  X2)  of  Example  2.  First  granulation  of  F  Since  the  whole  area  is  1,  the  limitation  is  set  as  ”1”.  If 
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we  correctly  granulate  it  to  the  parts  A  and  B,  then,  T(A) 
+  T(B)=1  and  T(A)  •  T(B)=0  hold.  If  we  granulate  to  Part 
A'  being  bigger  than  Part  A,  and  Part  B  as  shown  in 
Figure  4(c),  then  T(A')  +  T(B)=1  and  T(A')  •  T(B)  =  the 
lined  area  in  Figure  4(d).  If  we  granulate  to  Part  A”  being 
smaller  than  Part  A,  and  Part  B,  then  T(A")  +  T(B)  = 
lacked  area  and  T(A")  •  T(B)=0.  The  following  relation  thus 
holds. 

[Relation  1 :  Lukasiewicz  Class]  Consider  the  granulation 
of  the  whole  into  parts  xl,  x2,  ...,  xn.  We  normalized  the 
values  xi  (i=l,  ...,  n)  by  the  whole  values,  i.e.,  T(xi)  = 
values(xi)  /  value(the  whole),  then  upper  bound  of  T(xi) 
becomes  1. 

n  n 

If  S+T(xi)<  1  then  (1- £+T(xi))  non  granulated 
i=l  i=l 

granules  exist, 

n  n 

else  if  J"[*T(xi)  >  0  then  f|*T(xi)  over  granules  exist. 
i=l  i=l 

n  n 

else  if  £+T(xi)=l  and  JpT(xi)=0  then  correctly 
i=l  i=l 

granulated:  the  whole  into  the  parts  xl,  x2,  ...»  xn.  The  set 
of  xl,  x2,  ...,  xn  is  a  complete  granule  set  with  respect  to 
the  whole. 

[Example  4]  We  consider  the  granulation  of  human  brain 
MR  image.  We  detected  the  human  brain  MR  image  by  a 
MRI  device.  All  the  sliced  images  are  generated  from  neck 
to  top  of  brain  as  shown  in  Figure  5(a).  Figure  5(b) 
shows  the  granulated  brain  area.  The  whole  brain(white 
matter  and  gray  matter)  is  composed  by  left  cerebral 
hemisphere(LCH),  right  cerebral  hemisphere(RCH), 
cerebellum(CB)  and  a  part  of  brain  stem(BS).  We  set  the 
limitation  of  bold  union  as  the  volume  of  whole  brain, 
i.e.,  all  volumes  of  LCH,  RCH,  CB  and  BS  are  divided  by 
the  volume  of  whole  brain,  and  then  denote  T(A)  = 
Volume(A)/Volume(whole  brain).  Then,  T(LCH)  + 
T(RCH)  +  T(CB)  +  T(the  part  of  BS)  must  be  1,  and 
T(LCH)  •  T(RCH)  •  T(CB)  •  T(the  part  of  BS)  must  be  0. 
Otherwise  we  determine  that  the  granulation  is  wrong. 

4.  Representation  of  FIG  on  Probability 
Class 

In  this  section,  we  will  consider  FIG  on  Probability  class. 

For  Probability  P(A)  and  P(B)  for  possible  events  A 
andB, 

P(A  or  B)  =  P(A)  +  P(B)  -  P(A  and  B), 

P(A  and  B)  =  P(A)  x  P(B/A)  =  P(B)  x  P(A/B),  where 
P(A/B)  means  the  conditional  probability  of  A  occurring, 
given  that  B  has  occurred. 

In  our  probability  class,  on  considering  P(A/B)=P(A) 
and  P(B/A)  =  P(B),  we  derive  "V"  and  Hx\  We  therefore 
call  these  operators  "V"  and  "x"  probabilisticlike  sum  and 
product,  respectively.  Under  these  operations,  "V",  "x"  and 
"NOT”,  the  set  is  only  a  pseudocomplemented  non 
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(a)  A  raw  image  of  human  brain  MR  image. 


(b)  Human  brain  MR  image  granulation  of 
Example  4. 

Figure  5  An  example  on  Lukasiewicz  class. 


distribute  structure.  More  particularly,  "V”  and  "x" 
satisfy  only  commutativity,  associativity,  identity,  De 
Morgan's  laws  and  A  x  0  =0,  A  V  1  =1. 

[Example  5]  Problem:  Suppose  that  we  have  a  research 
meeting  tomorrow.  Please  estimate  whether  the  probability 
of  the  number  of  attendees  is  up  or  down,  compared  to  the 
number  of  attendees  in  last  year  ? 


number  of  attendees 

weather 

|  research  topics 

Place  Date 

Figure  6  A  granularity  structure  of 
Example  5:  Probability  class. 


Table  1  :  Data  table  of  Example  5 


last  year 

this  year 

attendees  # 

30 

(total  researcher  # 
:about  50) 

? 

the  weather 

cloudy 

maybe,  fine 

research 

topics 

fuzzy  logic(FL) 
and  neural 
networks(NN) 

Chaos(CA)  and 
Evolutionary 
computing(EC) 

Date 

first  Thursday, 
November 

first  Thursday, 
November 

Place 

Himeii 

Himeji 

The  granularity  and  an  estimation:  Considering  the 
granularity  of  the  number  of  attendees,  we  can  have  a  FIG 
structure  as  shown  in  Figure  6. 

<granule  of  the  weather>  :  If  it  is  rainy,  the 
attendees  number  may  be  fewer,  compared  to  cloudy  day  or 
fine  day.  In  this  problem,  we  do  not  have  to  take  care  to 
the  weather  granule. 

<granule  of  the  research  topics>:  Assume  that  we 
have  the  following  information  about  50  researchers. 
PX(FL)=0.7,  Pt(NN)=0.4,  Pt(CA)=0.3  and  Pt(EC)=0.5, 
where  Pj(x)  denotes  a  probabilistic  measure  (percentage) 
of  the  number  of  researchers  who  are  interested  in  topics  x 
by  analyzing  past  questionnaire.  Then, 

Pl(FL  or  NN)  =Pt(FL)  V  Pt(NN)  =  0.7  +  0.4  -  0.7  x  0.4 
=  0.82, 

PX(CA  or  EC)  =Pt(CA)  V  Pj(EC)  =  0.3  +  0.5  -  0.3  x 
0.5  =0.65, 

Thus,  Pt(FL  or  NN)  >  Pj(CA  or  EC),  the  fuzzy  vales  of 
attribute  of  the  granule  of  the  research  topics  (CA  and  EC) 
are  smaller  than  that  of  (FL  and  NN). 

<granule  of  the  place  and  date>  :  The  place  and  date 
in  tomorrow  meeting  are  as  same  as  those  in  last  year. 


Considering  the  probability  of  the  number  of  attendees 
according  to  their  granules,  the  degree  will  be  smaller,  i.e., 
we  estimate  that  the  number  of  attendees  is  smaller  than 
the  number  of  attendees  in  last  year. 

Thus,  the  probability  class  is  also  a  meaningful  class 
in  FIG.  This  class  can  express  the  structure  which 
includes  events  of  probabilisticlike  properties.  In  Example 
5,  the  calculation  on  granule  of  research  topics  seems  to  be 
right.  However,  considering  the  granularity  composed  of 
granules,  weather,  research  topics,  place  and  date,  the  FIG 
class  is  not  uniquely  determined,  i.e.,  maybe  it  is  classified 
into  either  Lukasiewicz  class  or  probabilistic  class. 
Moreover,  in  real  world  problem  we  have  the 
combination/fusion  classes  of  the  two  or  three  classes. 

5.  Conclusions 

This  paper  describes  three  classes  of  fuzzy  information 
granularity  and  their  representation  methods:  Kleene  class, 
Lukasiewicz  class  and  probability  class.  As  a  representative 
in  Kleene  class,  fuzzy  logic  functions  are  shown,  and  then 
the  application  to  the  image  granulation  is  demonstrated. 
Lukasiewicz  class  can  express  FIG  including  information 
specified  by  standard  arithmetic  sum  and  substation 
calculation.  As  the  application,  we  show  the  granulation  of 
human  brain  MR  image.  Probabilistic  class  is  introduced 
as  the  class  of  FIG  including  probabilistic  events.  In  the 
real  world  problem,  there  are  many  combination/fusion 
classes  of  them.  However,  this  paper  would  be  a  basis  to 
clarify  the  representation  of  fuzzy  information  granularity. 
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Abstract 

The  paper  presents  a  circuit  description  of  a  MVL 
flip-flop  which  is  implemented  in  MOS  technology.  The 
circuit  has  its  origins  in  a  bipolar  implementation  of  a 
fuzzy  flip-flop .  The  proposed  circuit  not  only  simplifies 
the  original  bipolar  design  but  it  offers  considerable 
potential  for  a  VLSI  implementation .  Simulation  of  the 
circuit  using  HSpice  indicates  that  it  can  also  be 
interpreted  as  a  multiple-valued  logic  flip-flop  for  any 
radix.  This  provides  a  basic  building  block  for  the 
increasing  developments  in  multiple-valued  VLSI  circuit 
design . 

1.  Introduction 

Multiple-valued  logic  (MVL)  is  making  an 
increasing  impact  on  VLSI  circuit  design  [1].  The 
theoretical  advantages  of  MVL  for  pinout  and 
interconnect-limited  IC  design  have  been  well  established 
and  widely  acknowledged,  as  signals  are  allowed  to 
assume  more  than  two  states.  A  number  of  hybrid 
primitives  [2]  have  been  designed  for  the  hardware 
implementation  of  multiple  valued  logic,  eg  the  T-gate  [3] 
and  the  U-gate  [4].  More  recent  developments  have 
introduced  designs  for  a  multiple-valued  logic  flip  flop  [5- 
10]. 

More  recently  there  has  been  a  complementary  trend 
in  the  development  of  circuit  realisations  of  the  fuzzy 
inference  process  [1 1].  An  important  contribution  in  this 
area  was  the  introduction  of  the  concept  of  a  fuzzy  flip- 
flop  [12-16],  which  is  an  extension  of  the  conventional 
binary  flip-flop.  This  research  introduced  the  flip-flop  as 


a  basic  building  block  for  the  realisation  of  a  fuzzy 
processor. 

There  are  obvious  similarities  between  fuzzy  logic 
and  multiple-valued  logic.  In  particular,  the  authors 
believe  that  one  method  of  exploiting  these  similarities  is 
to  use  fuzzy  sets  to  represent  each  level  in  an  MVL 
system.  This  would  have  the  effect  of  removing  the 
distinct  boundaries  associated  with  conventional  bivalent 
sets  and  allow  the  representation  of  each  level  to  overlap. 

The  major  benefit  of  such  an  approach  is  that  a  MVL 
system  could  then  readily  accommodate  differing  numbers 
of  levels  within  the  restrictions  of  the  conventional  power 
rails  and  that  the  noise  margins  required  between  levels 
would  be  facilitated  by  the  approximate  nature  of  the 
fuzzy  based  description.  This  approach  offers 
considerable  potential  for  VLSI  implementations  which 
are  currently  motivated  by  lower  power  designs.  This 
feature  provides  the  main  motivation  for  the  work 
presented  in  this  paper. 

The  paper  begins  with  the  description  of  the  bipolar 
JK  fuzzy  flip-flop  originally  proposed  by  Hirota  [14]. 
The  MOS  implementation  of  this  circuit  is  then  presented 
which  not  only  simplifies  the  original  bipolar  design  but  it 
also  offers  considerable  potential  for  a  VLSI 
implementation.  The  main  contribution  of  this  work 
however,  demonstrates  how  this  circuit  can  also  be 
interpreted  as  a  multiple-valued  logic  flip-flop  for  any 
radix. 

The  paper  is  divided  into  a  number  of  sections. 
Section  2  provides  an  introduction  to  the  concept  of  a 
fuzzy  flip-flop  and  describes  the  MOS  implementation  of 
this  circuit.  Section  3  then  provides  a  multiple-valued 
logic  interpretation  of  the  fuzzy  logic  flip-flop  for  any 
radix.  Finally  section  4  concludes  the  paper  with  a 
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discussion  of  the  results  and  an  outline  of  the  conclusions 
drawn  from  the  work. 

2.  The  fuzzy  JK  flip-flop 

Hirota  defined  the  set  and  reset  type  characteristic 
equations  of  a  binary  JK  flip  flop  as  a  fuzzy  flip-flop 
using  the  concepts  of  fuzzy  negation,  t-norm  and  s-norm 
[14].  The  characteristic  equations  of  the  reset  and  set 
forms  of  the  fuzzy  JK  flip-flop  are  defined  by  the 
following  equations: 

Qs(k+1)  =  (J  s  Q)  t  (Kn  s  Qn)  (1) 

QR(k+l)  =  (J  t  Qn)  s  (Kn  t  Q)  (2) 

where  Qs  and  QR  are  the  set  and  reset  forms  respectively;  t 
=  t-norm  operator;  s=  s-norm  operator;  and  the  superscript 
n  is  used  to  represent  complementation. 

There  are  a  number  of  operators  which  can  be  used 
to  realise  these  functions  but  when  Min,  Max  and  1-  are 
employed  for  the  t-norm,  s-norm  and  complementation 
operators  respectively;  this  simplifies  equations  1  and  2 
to  the  following: 

QR(k+l)  =  {J  Min  (1-Q)}  Max  {(1-K)  Min  Q}  (3) 

Qs(k+1)  =  {J  Max  Q}  Min  {(1  -  K)  Max  (1  -  Q)}  (4) 

Hirota  described  how  equations  3  and  4  can  be 
reduced  to  provide  the  fundamental  equation  of  the  Min- 
Max  type  fiizzy  flip-flop  as  defined  by  equation  5 

Q(t+1)= 

{J  Max  (1-K)}  Min  (J  Max  Q)  Min  {(1-K)  Max  (1-Q)} 

(5) 

This  defining  equation  was  implemented  using 
bipolar  technology  and  realised  using  discrete  components 
(using  38  transistors,  44  resistors,  10  diodes  and  2 
capacitors).  A  complete  circuit  schematic  and  a 
description  of  the  results  obtained  are  provided  in  the 
original  paper  [14]. 

This  bipolar  design  was  modified  by  the  authors,  to 
provide  a  MOS  implementation  of  this  fuzzy  flip-flop 
circuit  (using  62  transistors).  The  operation  of  this  circuit 
as  a  fuzzy  flip-flop  was  confirmed  by  simulation  using 
HSpice.  A  block  diagram  of  the  complete  circuit  of  the 
flip-flop  is  provided  in  Figure  1  at  the  end  of  the  paper. 
In  this  modular  representation,  the  circuit  is  broken  down 
into  a  number  of  functional  blocks  as  described  by  the 
following: 

Min:  Circuit  required  to  implement  the  Min  operation 


Max:  Circuit  required  to  implement  the  Max 

operation 

Negate:  Circuit  required  to  implement  the  complement 
operation 

Trans:  Circuit  required  to  implement  a  transmission 
gate 

Inv:  Circuit  required  to  implement  an  inverter 

The  circuit  realisation  for  the  Min,  Max  and  Negate 
blocks  (which  have  been  copied  from  the  full  working 
schematic)  are  provided  in  Figures  2,  3  and  4  respectively. 
In  these  circuits  all  the  transistors  are  depletion  mode 
devices. 

Figure  2  describes  the  Min  circuit  where  Q3  and  Q5 
are  configured  as  passive  pull-up  or  pull-down  resistors. 
Both  the  input  voltages  (I/Pl  and  I/P2)  and  the  output 
voltage  (O/P)  can  take  values  in  the  range  VEE  to  VCC. 
Consider  the  case  where  I/Pl  is  more  negative  than  I/P2 
and  assume  that  both  Q1  and  Q2  are  operating  above 
pinch  off.  This  assumption  will  always  be  valid  if  the 
gate/source  voltage  of  both  Q1  and  Q2  is  small.  If  the 
transconductance  of  both  Q1  and  Q2  is  maintained  large 
while  that  associated  with  Q3  is  maintained  small,  then  the 
voltage  on  the  gate  of  Q4  will  track  with  the  lowest  input 
voltage.  With  Q4  and  Q5  configured  as  a  voltage 
follower,  the  output  voltage  (O/P)  will  follow  that  gate 
voltage  of  Q4.  Therefore,  the  output  voltage  will  follow 
the  lower  of  the  two  inputs. 


Fig  2  :  Schematic  for  the  Min  circuit  block 

The  Max  circuit  presented  in  Figure  3  operates  in  a 
similar  fashion,  where  the  voltage  at  the  gate  of  Q5  will 
follow  either  the  gate  voltage  of  Q1  or  Q2,  whichever  is 
the  higher.  The  output  circuit  in  Figure  3  is  a  voltage 
follower  and  therefore  the  output  voltage  (O/P)  will 
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follow  the  gate  voltage  of  Q5  and  hence  the  larger  of  the 
two  input  voltages. 


Fig  3  :  Schematic  for  the  Max  circuit  block 

The  circuit  in  Figure  4  consists  of  a  voltage  follower, 
a  constant  current  source  and  a  differential  amplifier  with 
an  active  load.  When  the  input  voltage  (Vin)  increases, 
the  gate  voltage,  and  hence  drain  current,  of  Q9  increases 
resulting  in  a  proportional  decrease  in  the  drain  current  in 
Q8.  In  addition,  because  Q6  and  Q7  operate  in  a  current 
mirror  configuration,  the  drain  current  of  Q7  decreases. 
Hence,  a  current  flows  in  the  output  resistor,  R3,  such  that 
the  output  voltage  Vo  falls. 

However,  if  the  input  voltage  falls,  then  the  gate 
voltage,  and  consequently  drain  current,  of  Q9  falls, 
causing  a  proportional  increase  in  the  drain  current 
associated  with  Q8.  Owing  to  the  action  of  the  current 
mirror,  the  drain  current  of  Q7  increases  causing  a  current 
to  flow  in  the  output  resistor  R3.  The  direction  of  this 
current  is  such  that  the  output  voltage  increases. 
Essentially,  the  circuit  operates  as  an  inverter  and  in  the 
design  process  Rl,  R2  and  R3  can  be  chosen  to  yield  a 
transfer  function  obeying 

Vo  =  Vcc  -  Vin  (6) 

which  is  equivalent  to  the  Negate  operation  for  the  logic 
levels. 

The  resistors  (R1-R3)  can  be  implemented  as  MOS 
transistors  and  are  used  here  simply  to  aid  understanding 
of  the  functionality  of  the  circuit. 

The  MOS  circuit  not  only  reduced  the  component 
count  but  it  is  more  suitable  for  VLSI  implementation. 
The  circuit  was  simulated  using  HSpice  providing  results 
which  agreed  with  the  fundamental  equation  as  defined  by 
equation  5.  The  next  section  describes  how  these  results 


can  be  interpreted  for  a  MVL  flip-flop.  Ongoing  work, 
within  the  research  group,  is  refining  the  design  and 
realising  the  circuit  in  hardware  as  a  full-custom  VLSI 
design. 


vcc 


|VEE 


Fig  4  :  Schematic  for  the  Negate  circuit  block 

3.  The  multiple-valued  logic  interpretation. 

The  flip-flop  circuit  was  simulated  using  the  HSpice 
simulation  package.  Inputs  were  confined  to  the  range  0- 
5  V  and  the  results  confirmed  those  reported  in  the  original 
paper  [14].  In  this  work  the  circuit  was  interpreted  as  a 
MVL  flip-flop  and  the  authors  partitioned  the  input  space 
into  a  number  of  fuzzy  sets  for  each  of  the  logical  inputs. 
This  strategy  used  triangular  fuzzy  sets  with  unity 
partition  as  shown  in  Figs  5,  6  and  7  below.  For  example, 
in  Fig  6  a  logic  1  is  interpreted  as  having  full  membership 
at  a  voltage  level  of  2.5V  and  partial  membership 
extending  to  the  0V  and  5V  limits.  Similarly  all  the  other 
fuzzy  sets  have  the  logical  level  represented  by  a  central 
voltage  with  decreasing  membership  at  all  other  voltages. 

This  representation  was  used  as  a  reference  to 
interpret  the  results  of  the  simulations  during  which  the 
inputs  were  initialised  at  that  voltage  corresponding  to  the 
maximum  set  membership.  In  practice  the  output  voltage 
closely  matched  one  of  these  central  voltages  with  a 


296 


maximum  deviation  of  +0.5V.  Ongoing  work  within  the 
group  is  investigating  the  effect  of  varying  each  input 
within  its  set  description  and  noting  the  tolerance  of  the 
output. 


Fig  5:  Logic  level  assignment  when  radix  =2 


Fig  6:  Logic  level  assignment  when  radix  =3 


The  results  of  the  Spice  simulation  are  summarised 
in  Tables  1,  2  and  3.  These  results  document  the  truth 
tables  of  the  flip-flop  circuit  using  radices  of  2,  3  and  4 
respectively. 


''XJ 

K 

0 

1 

0 

Q 

1 

1 

0 

Q* 

Table  1:  Derived  truth-table  for  MVL  flip-flop  when 
radix=2 


\j 

K 

0 

1 

2 

0 

Q 

Max(Q,l) 

2 

1 

Min(Q,l) 

1 

Max(Q*,l) 

2 

0 

Min(Q*,l) 

Q* 

Table  2:  Derived  truth-table  for  MVL  flip-flop  when 
radix=3 


V 

K\ 

0 

1 

2 

3 

0 

Q 

Max(Q,l) 

Max(Q,2) 

3 

1 

Min(Q,2) 

Max[ 

Min(Q*,l), 

Min(Q,2)] 

2 

Max(Q*,2) 

2 

Min(Q,l) 

1 

Max[ 

Min(Q,l), 

Min(Q*,2)] 

Max(Q*,l) 

3 

0 

Min(QM) 

Min(Q*,2) 

Q* 

Table  3:  Derived  truth-table  for  MVL  flip-flop  when 
radix=4 


where  the  complement  operator  is  defined  as: 

Q*  =  (p  -  Q) 

and  p  is  given  by: 
p  =  R-1 

where  R  is  the  radix. 

The  terms  in  the  truth  table  are  represented  using  the 
Min  and  Max  operators  and  some  examples  of  these 
operations  are  illustrated  in  Table  4  using  a  radix  of  4. 
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Q 

Min  (Q,l) 

Max(Q,2) 

Max[ 

Min(QM), 

Min(Q,2)] 

0 

0 

2 

1 

1 

1 

2 

1 

2 

1 

2 

2 

3 

1 

3 

2 

Table  4:  Examples  of  terms  in  the  MVL  flip-flop  truth 
table  (when  radix=4) 


Examining  Tables  1,  2  and  3  reveal  that  there  is 
clearly  a  pattern  emerging  as  the  next  state  of  the  output  is 
dependent  on  a  combination  of  the  present  state  and  a 
fixed  value.  Indeed  there  is  a  degree  of  symmetry  in  the 
truth  tables.  For  example  if  you  consider  the  truth  table  for 
radix  3  presented  in  Table  2  and  use  DeMorgan’s  theorem 
one  can  show  that  the  following  relationship  applies: 

Min(Q,l)  =  [Max(Q*  1)]* 

These  truth  tables  differ  from  previously  reported 
MVL  designs,  as  they  are  more  dependent  on  the  previous 
value  of  the  output.  However,  there  is  clearly  a  pattern 
emerging  as  the  output  can  be  set  to  a  given  value, 
complemented,  or  made  a  function  of  the  previous  output. 
This  pattern  can  be  used  to  predict  the  truth  table  for  any 
chosen  radix;  this  was  confirmed  by  simulation. 

4.  Discussion  and  Conclusions. 

The  research  presented  in  this  paper  describes  the 
MOS  implementation  of  a  fuzzy  flip-flop  which  is 
appropriate  for  a  VLSI  implementation.  The  simulation 
results  demonstrated  the  successful  operation  of  the  circuit 
as  a  fuzzy  flip-flop. 

The  main  contribution  of  the  work  is  the 
demonstration  that  this  fuzzy  flip-flop  can  also  be 
interpreted  as  a  MVL  flip-flop.  The  authors  described 
how  the  logic  levels  were  represented  using  fuzzy  sets  and 
simulation  results  were  presented  to  confirm  the  circuit’s 
operation.  The  results  indicate  a  clear  pattern  emerging  in 
the  truth  tables  for  any  chosen  radix. 

This  paper  reports  on  the  initial  investigations  into 
the  use  of  fuzzy  logic  and  fuzzy  set  descriptors  in  the 
interpretation  of  logic  levels  for  MVL.  Such  an 
interpretation  offers  considerable  potential  as  the 
application  of  MVL  logic  circuit  is  becoming  increasingly 
important  in  digital  signal  processing  and  computing 
applications.  The  current  research  has  not  yet  exploited 


the  noise  immunity  capabilities  of  the  fuzzy  reasoning 
process  in  this  interpretation  but  it  is  the  authors  opinion 
that  this  will  be  employed  when  such  sub-circuits  are 
employed  in  a  complete  design.  This  area  of  research  is 
being  currently  investigated  by  staff  of  the  Intelligent 
Systems  Engineering  Laboratory. 
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